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The invention was made with government support under grant number CA64394 
awarded by the National Institutes of Health. The government has certain rights in the 
invention. 



1. INTRODUCTION 

1 5 The present invention relates to the use of alpha (2) macroglobulin ("a2M") receptor 

as a heat shock protein receptor, cells that express the a2M receptor bound to an HSP, and 
antibodies and other molecules that bind the <x2M receptor-HSP complex. The invention 
also relates to screening assays to identify compounds that modulate the interaction of an 
HSP with the a2M receptor, and methods for using compositions comprising a2M-receptor 

20 sequences for the diagnosis and treatment of immune disorders, proliferative disorders, and 
infectious diseases. 



2. BACKGROUND OF THE INVENTION 

2.1 . HEAT SHOCK PROTEINS 

Heat shock proteins (HSPs), also referred to as stress proteins, were first identified 
as proteins synthesized by cells in response to heat shock. Hsps have classified into five 
families, based on molecular weight, HsplOO, Hsp90, Hsp7Q, Hsp60, and smHsp. Many 
members of these families were found subsequently to be induced in response to other 
stressful stimuli including nutrient deprivation, metabolic disruption, oxygen radicals, and 
infection with intracellular pathogens (see Welch, May 1993, Scientific American 56-64; 
Young, 1990, Annu. Rev. Immunol. 8:401-420; Craig, 1993, Science 260:1902-1903; 
Gething et aL, 1992, Nature 355:33-45; and Lindquist et al, 1988, Annu. Rev. Genetics 
22:631-677). 

Heat shock proteins are among the most highly conserved proteins in existence. For 
example, DnaK, the Hsp70 from E. coli has about 50% amino acid sequence identity with 
Hsp70 proteins from excoriates (Bardwell et aL, 1984, Proc. Natl. Acad. Sci. 81:848-852).' 
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The Hsp60 and Hsp90 families also show similarly high levels of intra-family conservation 
(Hickey et al, 1989, MoL Cell. Biol 9:2615-2626; Jindal, 1989, Mol. Cell. Biol. 9:2279- 
2283). In addition, it has been discovered that the Hsp60, Hsp70 and Hsp90 families are 
composed of proteins that are related to the stress proteins in sequence, for example, having 

5 greater than 35% amino acid identity, but whose expression levels are not altered by stress. 
Studies on the cellular response to heat shock and other physiological stresses 
revealed that the HSPs are involved not only in cellular protection against these adverse 
conditions, but also in essential biochemical and immunological processes in unstressed 
cells. HSPs accomplish different kinds of chaperoning functions. For example, members 

10 of the Hsp70 family, located in the cell cytoplasm, nucleus, mitochondria, or endoplasmic 
reticulum (Lindquist et al, 1988, Ann. Rev. Genetics 22:631-677), are involved in the 
presentation of antigens to the cells of the immune system, and are also involved in the 
transfer, folding and assembly of proteins in norma! cells. HSPs are capable of binding 
proteins or peptides, and releasing the bound proteins or peptides in the presence of 

1 5 adenosine triphosphate (ATP) or low pH. 

2.2. BMMUNOGENICITY OF HSP-PEPTIDE COMPLEXES 

Srivastava et al demonstrated immune response to methylcholanthrene-induced 
sarcomas of inbred mice (1988, Immunol. Today 9:78-83). In these studies, it was found 

20 that the molecules responsible for the individually distinct inununogenicity of these tumors 
were glycoproteins of 96kDa (gp96) and intracellular proteins of 84 to 86kDa (Srivastava et 
al, 1986, Proc. Natl. Acad. Sci. USA 83:3407-341 1; Ullrich et al, 1986, Proc. Natl. Acad. 
Sci. USA 83:3121-3125). Immunization of mice with gp96 or p84/86 isolated from a 
particular tumor rendered the mice immune to that particular tumor, but not to antigenically 

25 distinct tumors. Isolation and characterization of genes encoding gp96 and p84/86 revealed 
significant homology between them, and showed that gp96 and p84/86 were, respectively, 
the endoplasmic reticular and cytosolic counterparts of the same heat shock proteins 
(Srivastava et al, 1988, Immunogenetics 28:205-207; Srivastava et al, 1991, Curr. Top. 
Microbiol. Immunol. 167:109-123). Further, Hsp70 was shown to elicit immunity to the 

30 tumor from which it was isolated but not to antigenically distinct tumors. However, Hsp70 
depleted of peptides was found to lose its immunogenic activity (Udono and Srivastava, 
1993, J. Exp. Med. 178:1391-1396). These observations suggested that the heat shock 
proteins are not immunogenic per se, but form noncovalent complexes with antigenic 
peptides^ and the complexes can elicit specific immunity to the antigenic peptides 

35 (Srivastava, 1993, Adv. Cancer Res. 62:153-177; Udono et al, 1994, J. Immunol., 
152:5398-5403; Suto etal, 1995, Science, 269:1585-1588). 
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Noncovalent complexes of HSPs and peptide, purified from cancer cells, can be used 
for the treatment and prevention of cancer and have been described in PCT publications WO 
96/1041 1, dated April 11, 1996, and WO 97/10001, dated March 20, 1997 (U.S. Patent No. 
5,750,119 issued April 12, 1998, and U.S. Patent No. 5,837,251 issued November 17, 1998, 

5 respectively, each of which is incorporated by reference herein in its entirety). The isolation 
and purification of stress protein-peptide complexes has been described, for example, from 
pathogen-infected cells, and can be used for the treatment and prevention of infection caused 
by the pathogen, such as viruses, and other intracellular pathogens, including bacteria, 
protozoa, fungi and parasites (see, for example, PCT Publication WO 95/24923, dated 

10 September 21, 1995). Immunogenic stress protein-peptide complexes can also be prepared 
by in vitro complexing of stress protein and antigenic peptides, and the uses of such 
complexes for the treatment and prevention of cancer and infectious diseases has been 
described in PCT publication WO 97/10000, dated March 20, 1997 (U.S. Patent No. 
6,030,618 issued February 29, 2000. The use of stress protein-peptide complexes for 

15 sensitizing antigen presenting cells in vitro for use in adoptive immunotherapy is described 
in PCT publication WO 97/10002, dated March 20, 1997 (see also U.S. Patent No. 5,985,270 
issued November 16, 1999). 



2.3. ALPHA (2) MACROGLOBUUN RECEPTOR 

20 The alpha (2) macroglobulin receptor (herein referred to interchangeably as either 

"a2MR" or "the a2M receptor"), also known as LDL (low-density lipoprotein) receptor- 
Related Protein ("LRP") or CD91, is primarily expressed in liver, brain and placenta. The 
a2M receptor is a member of the low density lipoprotein receptor family. The extracellular 
domain of the human receptor comprises six 50-amino acid EGF repeats and 31 complement 

25 repeats of approximately 40-42 amino acids. The complement repeats are organized, from 
the amino to the carboxy-terminus, into clusters of 2, 8, 10 and 11 repeats, called Cluster I, 
H, HI and IV (Hera et aL 9 1988, EMBO J. 7:41 19-4127). One study points to Cluster H (Cl- 
II), which contains complement repeats 3-10 (CR3-10), as the major ligand binding portion 
of the receptor (Horn et aL, 1997, J. Biol. Chem. 272:13608-13613). The a2M receptor 

30 plays a role in endocytosis of a diversity of ligands. In addition to <x2M, other ligands of 
<x2MR include lipoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), 
urokinase-type plasminogen activator (uPA), and exotoxins. Thus, the a2M receptor plays 
roles in a variety of cellular processes, including endocytosis, antigen presentation, 
cholesterol regulation, ApoE-containing lipoprotein clearance, and chylomicron remnant 

35 removal. 
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Human cx2M is synthesized as a 1474 amino acid precursor, the first 23 of which 
function as a signal sequence that is cleaved to yield a 1451 amino acid mature protein (Kan 
et al., 1985, Proc. Natl. Acad. Sci. U.S. A. 82:2282-2286). In experiments with recombinant 
protein, the carboxy-teiminal 138 amino acids of a2M (representing amino acids 1314-1451 

5 of the mature protein) was found to bind the receptor. This domain has been called the RBD 
(receptor-binding domain; Salvesent et al, 1992, FEBS Lett. 313:198-202; Holtet et al, 
1994, FEBS Lett. 344:242-246). An RBD variant (RBDv), a proteolytic fragment of a2M 
comprising an additional 15 amino terminal residues (representing amino acids 1314-1451 of 
the mature protein) binds to the receptor with almost the same affinity as a2M-proteinase 

10 (Holtet et al, 1994, FEBS Lett. 344:242-246). 

Alignment of a2MR ligands identifies a conserved domain present in the RBDs of a 
macroglobulins. The conserved sequence spans amino acids 1366-1392 of human a2M. 
Conserved residues within this domain are Phe 1366 , Leu 1369 , Lys 1370 , Val 1373 , Lys ]374 , Glu 3377 , 
Val 1382 , Arg 1384 (Nielsen et al, 1996, J. Biol. Chem. 271 :12909-12912). Of these, Lys 1370 and 

15 Lys 1374 were shown to be critical for receptor binding (Nielsen et al, 1996, J. Biol. Chem. 
271:12909-12912). 

Binding of ligands, including the binding to a2M, to a2MR is inhibited by a2MR- 
associated protein (RAP). RAP is a 39 kDa folding chaperone that resides in the 
endoplasmic reticulum and is required for the normal processing of a2MR. RAP has the 

20 ability to competitively inhibit the binding of all <x2MR to all <x2MR ligands tested. One 
study shows RAP to bind to complement repeats C5-C7 in cluster II (Cl-II) of cx2MR (Horn 
et al., 1997, J. Biol. Chem. 272:13608-13613); another shows RAP to bind to all two 
complement repeat-modules in Cl-II except the C9-C10 module (Andersen et al, J. Biol 
Chem., Mar. 24, 2000, PMID: 10747921; published electronically ahead of print). Three 

25 structural domains, 1, 2 and 3, have been identified in RAP, consisting of amino acid 
residues 18-112, 113-218 and 219-323, respectively. Ligand competition titration of 
recombinant RAP domains indicates that determinants for the inhibition of test ligands 
reside in the C-terminal regions of domains 1 and 3 (Ellgaard et al, 1997, Eur. J. Biochem. 
244:544-51). 

30 

2.4. ANTIGEN PRESENTATION 

Major histocompatibility complex (MHC) molecules present antigens on the cell 
surface of antigen-presenting cells. Cytotoxic T lymphocytes (CTLs) then recognize MHC 
molecules and their associated peptides and kill the target cell. Antigens are processed by 
35 two distinct antigen processing routes depending upon whether their origin is intracellular or 
extracellular. Intracellular or endogenous protein antigens, i.e., antigens synthesized within 
the antigen-presenting cell, are presented by MHC class I (MHC I) molecules to CD8+ 

-4- 
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cytotoxic T lymphocytes. On the other hand, extracellular or exogenously synthesized 
antigenic determinants are presented on the cell surface of "specialized" or "professional" 
APCs (macrophages, for example) by MHC class II molecules to CD4+ T cells (see, 
generally, Fundamental Immunology, W.E. Paul (ed.), New York: Raven Press, 1984). This 

5 compartmental segregation of antigen processing routes is important to prevent tissue 

destruction that could otherwise occur during an immune response as a result of shedding of 
neighboring cell MHC I antigens. 

The heat shock protein gp96 chaperones a wide array of peptides, depending upon the 
source from which gp96 is isolated (for review, see Srivastava et al, 1998, Immunity 8: 657- 

10 665). Tumor-derived gp96 carries tumor-antigenic peptides (Ishii et ah, 1999, J. 
Immunology 162:1303-1309); gp96 preparations from virus-infected cells carry viral 
epitopes (Suto and Srivastava, 1995, Science 269:1585-1588; Nieland et al 9 1998, Proc. 
Natl. Acad. Sci. USA 95:1800-1805), and gp96 preparations from cells transfected with 
model antigens such as ovalbumin or P-galactosidase are associated with the corresponding 

15 epitopes (Arnold et al, 1995, J. Exp. Med. 182:885-889; Breloer et al, 1998, Eur. J. 
Immunol 28:1016-1021). The association of gp96 with peptides occurs in vivo (Menoret 
and Srivastava, 1999, Biochem. Biophys. Research Commun. 262:813-818). Gp96-peptide 
complexes, whether isolated from cells (Tamura et al, 1997, Science 278:117-120), or 
reconstituted in vitro (Blachere etal, 1997, J. Exp. Med. 186:1183-1406) are excellent 

20 immunogens and have been used extensively to elicit CD8+ T cell responses specific for the 
gp96-chaperoned antigenic peptides. 

The capacity of gp96-peptide complexes to elicit an immune response is dependent 
upon the transfer of the peptide to MHC class I molecules of antigen-presenting cells (Suto 
and Srivastava, 1995, supra). Endogenously synthesized antigens chaperoned by gp96 in 

25 the endoplasmic reticulum [ER] can prime antigen-specific CD8+ T cells (or MHC I- 

restricted CTLs) in vivo; this priming of CD8+ T cells requires macrophages. However, the 
process whereby exogenously introduced gp96-peptide complexes elicit the antigen-specific 
CD8+ T cell response is not completely understood since there is no established pathway for 
the translocation of extracellular antigens into the class I presentation machinery. Yet 

30 antigenic peptides of extracellular origin associated with HSPs are somehow salvaged by 
macrophages, channeled into the endogenous pathway, and presented by MHC I molecules 
to be recognized by CD8+ lymphocytes (Suto and Srivastava, 1995, supra; Blachere et al, 
1997 \ J. Exp. Med. 186:1315-22). 

Several models have been proposed to explain the delivery of extracellular peptides 

35 for antigen presentation. One proposal, known as the "direct transfer" model, suggests that 
HSP-chaperoned peptides are transferred to MHC I molecules on the cell surface of 
macrophages for presentation to CD8+ T lymphocytes. Another suggestion is that soluble 
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extracellular proteins can be trafficked to the cytosol via constitutive macropinocytosis in 
bone marrow-derived macrophages and dendritic cells (Norbury et aL, 1997, Eur. J. 
Immunol. 27:280-288). Yet another proposed mechanism is that HSPs are taken up by the 
MHC class I molecules of the macrophage, which stimulate the appropriate T cells 

5 (Srivastava et aL, 1994, Immunogenetics 39:93-98. Others have suggested that a novel 
intracellular trafficking pathway may be involved for the transport of peptides from the 
extracellular medium into the lumen of ER (Day et aL, 1997, Proc. Natl. Acad. Sci. 94:8064- 
8069; Nicchitta, 1998, Curr. Opin. in Immunol. 10:103-109). Further suggestions include 
the involvement of phagocytes which (a) possess an ill-defined pathway to shunt protein 

1 o from the phagosome into the cytosol where it would enter the normal class I pathway; (b) 
digest ingested material in lysosomes and regurgitate peptides for loading on the surface to 
class I molecules (Bevan, 1995, J. Exp. Med. 182:639-41). 

Still others have proposed a receptor-mediated pathway for the delivery of 
extracellular peptides to the cell surface of APCs for antigen presentation. In view of the 

1 5 extremely small quantity of gp96-chaperoned antigenic peptides required for immunization 
(Blachere et aL, 1997, supra), and the strict dependence of immunogenicity of gp96-peptide 
complexes on functional antigen presenting cells (APCs) (Udono et aL, 1994, Proc. Natl. 
Acad. Sci. U.S.A. 91:3077-3081), APCs had been proposed to possess receptors for gp96 
(Srivastava et aL, 1994, Immunogenetics 39:93-98). Preliminary microscopic evidence 

20 consistent with such receptors has been recently obtained (Binder et aL, 1998, Cell Stress & 
Chaperones 3 (Supp.l):2.; Arnold-Schild et aL, 1999, J. Immunol. 162: 3757-3760; and 
Wassenberg et aL, 1999, J. Cell Sci. 1:12). One hypothesis is that the mannose receptor is 
used in the uptake of gp96, but no mechanism has been proposed for the non-glycosylated 
HSPs, such as Hsp70 (Ciupitu et aL, 1998, J. Exp. Med., 187:685-691). 

25 The identification and characterization of specific molecules involved in HSP- 

mediated antigen presentation of peptides could provide useful reagents and techniques for 
eliciting specific immunity by HSP and HSP-peptide complexes, and for developing novel 
diagnostic and therapeutic methods. 

Citation or discussion of a reference herein shall not be construed as an admission 

3Q that such is prior art to the present invention. 

3. SUMMARY OF THE INVENTION 

The present invention relates to compositions and methods for the use of the alpha (2) 
macroglobulin ("a2M") receptor as a heat shock protein receptor. The invention is based, in 
35 part, on the Applicant's discovery that the a2M receptor is a cell surface receptor for heat 
shock proteins. In particular, the Applicant has shown that the heat shock protein gp96, 
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hsp90, hsp70, and calreticulin binds directly to the a2M receptor, and that a2M inhibits re- 
presentation of gp96, hsp90, hsp70, and calreticulin-chaperoned antigenic peptides by 
macrophages. Because no precedent exists for receptors that recognize abundant and 
intracellular proteins like HSPs, the discovery of an HSP cell surface receptor was highly 
5 unexpected. 

The present invention provides compositions comprising complexes of HSPs and the 
a2M receptor, and antibodies and other molecules that bind the HSP-a2M receptor complex. 
The invention also encompasses methods for the use of the <x2M receptor as a heat shock 
protein receptor, including methods for screening for compounds that modulate the 

10 interaction of HSP and the a2M receptor, and methods for treatment and detection of HSP- 
<x2M receptor-mediated processes and HSP-a2M receptor-related disorders and conditions, 
such as autoimmune disorders, proliferative disorders and infectious diseases. 

The invention provides a method for identifying a compound that modulates an HSP- 
<x2M receptor-mediated process, comprising: (a) contacting a test compound with a heat 

1 5 shock protein and an alpha (2) macroglobulin receptor; and (b) measuring the level of alpha 
(2) macroglobulin receptor activity or expression, such that if the level of activity or 
expression measured in (b) differs from the level of alpha (2) macroglobulin receptor activity 
in the absence of the test compound, then a compound that modulates an HSP-a2M 
receptor-mediated process is identified. In one embodiment of this method the compound 

20 identified is an antagonist which interferes with the interaction of the heat shock protein with 
the alpha (2) macroglobulin receptor, further comprising the step of: (c) determining 
whether the level interferes with the interaction of the heat shock protein and the alpha(2) 
macroglobulin receptor. In another embodiment, the test compound is an antibody specific 
for the alpha (2) macroglobulin receptor. In another embodiment, the test compound is an 

25 antibody specific for alpha (2) macroglobulin. In another embodiment, test compound is an 
antibody specific for a heat shock protein. In another embodiment, the test compound is a 
small molecule. In another yet embodiment, the test compound is a peptide. In another 
embodiment, the peptide comprises at least 5 consecutive amino acids of the alpha (2) 
macroglobulin receptor. In yet another embodiment, the peptide comprises at least 5 

30 consecutive amino acids of alpha (2) macroglobulin. In yet another embodiment, the peptide 
comprises at least 5 consecutive amino acids of a heat shock protein sequence. In another 
embodiment, the compound is an agonist which enhances the interaction of the heat shock 
protein with the alpha (2) macroglobulin receptor. In another embodiment, which the HSP- 
a2M receptor-mediated process affects an autoimmune disorder, a disease or disorder 

35 involving disruption of antigen presentation or endocytosis, a disease or disorder involving 
cytokine clearance or inflammation, a proliferative disorder, a viral disorder or other 
infectious disease, hypercholesterolemia, Alzheimer's disease, diabetes, or osteoporosis. 
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The invention also provides a method for identifying a compound that modulates an 
HSP-a2M receptor-mediated process, comprising: (a) contacting a test compound with a 
heat shock protein and an alpha (2) macroglobulin receptor-expressing cell; and (b) 
measuring the level of alpha (2) macroglobulin receptor activity or expression in the cell, 

5 such that if the level of activity or expression measured in (b) differs from the level of alpha 
(2) macroglobulin receptor activity in the absence of the test compound, then a compound 
that modulates an HSP-a2M receptor-mediated process is identified. In yet another 
embodiment, wherein the alpha (2) macroglobulin receptor activity measured is the ability to 
interact with a heat shock protein. 

lO The invention also encompasses a method for identifying a compound that modulates 

the binding of a heat shock protein to the a2M receptor, comprising: (a) contacting a heat * 
shock protein with an alpha (2) macroglobulin receptor, or fragment, or analog, derivative or 
mimetic thereof, in the presence of a test compound; and (b) measuring the amount of heat 
shock protein bound to the alpha (2) macroglobulin receptor, or fragment, analog, derivative 

15 or mimetic thereof, such that if the amount of bound heat shock protein measured in (b) i 
differs from the amount of bound heat shock protein measured in the absence of the test 
compound, then a compound that modulates the binding of an HSP to the a2M receptor is 
identified. In another embodiment, alpha (2) macroglobulin receptor contacted in step (a) is 
on a cell surface. In another embodiment, the alpha (2) macroglobulin receptor is 

20 immobilized to a solid surface. In another embodiment, the solid surface is a microtiter dish. 

In another embodiment, the amount of bound heat shock protein is measured by contacting j 
the cell with a heat shock protein-specific antibody. In yet another embodiment, the heat I 
shock protein is labeled and the amount of bound heat shock protein is measured by 
detecting the label. In another embodiment, the heat shock protein is labeled with a 

25 fluorescent label. 

The invention further provides a method for identifying a compound that modulates 
heat shock protein-mediated antigen presentation by alpha (2) macroglobulin receptor- 
expressing cells comprising: (a) adding a test compound to a mixture of alpha (2) 1 
macroglobulin receptor-expressing cells and a complex consisting essentially of a heat shock I 

30 protein noncovalently associated with an antigenic molecule, under conditions conducive to % 
alpha (2) macroglobulin receptor-mediated endocytosis; (b) measuring the level of antigen- 
specific stimulation of cytotoxic T cells by alpha (2) macroglobulin receptor-expressing 
cells, such that if the level measured in (b) differs from the level of said stimulation in the 
absence of the test compound, then a compound that modulates heat shock protein-mediated 

35 antigen presentation by alpha (2) macroglobulin receptor-expressing cells is identified. In 
one embodiment of this method, the step of measuring the level of the antigenic molecule 
presented on the cell surface of step (b) comprises: (i) adding the alpha (2) macroglobulin 
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receptor-expressing cells formed in step (a) to T cells under conditions conducive to the 
activation of the T cells; and (ii) comparing the level of activation of said cytotoxic T cells 
with the level of activation of T cells by an alpha (2) microglobulin receptor-expressing cell 
formed in the absence of the test compound, wherein an increase of decrease in level of T 

5 cell activation indicates that a compound that modulates heat shock protein-mediated antigen 
presentation by alpha (2) microglobulin receptor-expressing cells is identified. 

hi various embodiments, the heat shock protein used in the methods of the invention 
is gp96. Alternatively, the heat shock proteins hsp90, hsp70, or calreticulin may be used in 
various embodiments of the invention. 

10 In another embodiment, the invention provides a method for detecting a heat shock 

protein-alpha (2) macroglobulin receptor-related disorder in a mammal comprising 
measuring the level of an HSP-alpha (2) macroglobulin receptor-mediated process in a 
patient sample, such that if the measured level differs from the level found in clinically 
normal individuals, then a heat shock protein-alpha (2) macroglobulin receptor-related 

15 disorder is detected. 

The invention also encompasses kits comprising compositions of the invention. In 
one embodiment, a kit is provided, packaged in one or more containers, comprising: (a) a 
purified heat shock protein, nucleic acid encoding a heat shock protein, or cell expressing a 
heat shock protein; and (b) an alpha (2) macroglobulin receptor polypeptide, nucleic acid 

20 enc °ding an alpha (2) macroglobulin receptor polypeptide, or cell expressing an alpha (2) 
macroglobulin receptor polypeptide. In one embodiment, the kit the alpha (2) macroglobulin 
receptor polypeptide, nucleic acid encoding an alpha (2) macroglobulin receptor polypeptide, 
or cell expressing an alpha (2) macroglobulin receptor polypeptide is purified. In another 
embodiment, the kit further comprises instructions for use in treating an autoimmune 

25 disorder, an infectious disease^ dr i proliferative disorder. 

The invention also provides a method for modulating an immune response 
comprising administering to a mammal a purified compound that modulates the interaction 
of a heat shock protein with the alpha (2) macroglobulin receptor. In one embodiment, the 
compound is an agonist which enhances the interaction of the heat shock protein and the 

30 alpha (2) macroglobulin receptor. In another embodiment of this method the compound in 
an antagonist that interferes with the interaction between the heat shock protein and the a2M 
receptor. 

The invention further provides a method for treating an autoimmune disorder 
comprising administering to a mammal in need of such treatment a purified compound that 
35 interferes with the interaction of a heat shock protein with the alpha (2) macroglobulin 
receptor. In one embodiment of this method the compound in an antagonist that interferes 
with the interaction between the heat shock protein and the <x2M receptor. In one 
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embodiment, the antagonist is an antibody specific for alpha (2) macroglobulin receptor. In 
another embodiment, the antagonist is an antibody specific for a heat shock protein, hi 
another embodiment, the antagonist is a small molecule. In another embodiment, the 
antagonist is a peptide. In another embodiment, the peptide comprises at least 5 consecutive 
5 amino acids of alpha (2) macroglobulin receptor. In another embodiment, the peptide 
comprises at least 5 consecutive amino acids of alpha (2) macroglobulin. In another 
embodiment, the peptide comprises at least 5 consecutive amino acids of a heat shock protein 
sequence. 

The invention further provides a method for increasing the immunopotency of a 
10 cancer cell or an infected cell comprising transforming said cell with a nucleic acid 

comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes 

an alpha (2) macroglobulin receptor polypeptide. j 

Still further, the invention provides a method for increasing the immunopotency of a 
cancer cell or an infected cell comprising: (a) transforming said cell with a nucleic acid 
15 comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes 
an alpha (2) macroglobulin receptor polypeptide, and (b) administering said cell to an 
individual in need of treatment, so as to obtain an elevated immune response. 

The invention also provides a recombinant cancer cell transformed with a nucleic 
acid comprising a nucleotide sequence that (i) is operably linked to a promoter, and (ii) § 
20 encodes an alpha (2) macroglobulin receptor polypeptide. In one embodiment, the 
recombinant cell is a human cell. 

In yet another embodiment, the invention provides a recombinant infected cell j 
transformed with a nucleic acid comprising a nucleotide sequence that (i) is operably linked 
to a promoter, and (ii) encodes an alpha (2) macroglobulin receptor polypeptide. In one 
25 embodiment, the recombinant cell is a human cell. 

In another embodiment, the invention provides a method for screening for molecules 
that specifically bind to an a2M receptor comprising the steps of: (a) contacting an <x2M 
receptor with one or more test molecules under conditions conducive to binding; and 
(b) determining whether any of said test molecules specifically bind to the a2M receptor. 
30 In one embodiment of this method, test molecules are potential immunotherapeutic drugs. 

The invention also provides a method for identifying a compound that modulates the 
binding of an a2M receptor ligand to the ct2M receptor comprising: contacting an a2M 
receptor with an <x2M receptor ligand, or an a2M receptor-binding fragment, analog, 
derivative, or mimetic thereof, in the presence of one or more test compound; and (b) 
35 measuring the amount of a2M receptor ligand, or fragment, analog, derivative or mimetic 
thereof, bound to the a2M receptor, such that if the amount of bound <x2M receptor ligand 
measured in (b) differs from the amount of bound o2M measured in the absence of the test 
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compound, then a compound that modulates the binding of an a2M receptor ligand to the 
a2M receptor is identified. 

In another embodiment, a method is provided for identifying a compound that 
modulates the interaction between the <x2M receptor and an a2M receptor ligand, comprising: 

5 (a) contacting an a2M receptor with one or more test compounds; and (b) measuring the 
level of a2M receptor activity or expression, such that if the level of activity or expression 
measured in (b) differs from the level of a2M receptor activity in the absence of one or more 
test compounds, then a compound that modulates the interaction between the a2M receptor 
and an a2M receptor ligand is identified. In one embodiment, the a2M receptor ligand is 

10 a2M. 

In another embodiment, a method is provided for identifying a compound that 
modulates antigen presentation by a2M receptor-expressing cells comprising: (a) adding one 
or more test compounds to a mixture of a2M receptor-expressing cells and a complex 
comprising an a2M receptor ligand and an antigenic molecule, under conditions conducive to 

1 5 ct2M receptor-mediated endocytosis; (b) measuring the level of stimulation of antigen- 
specific cytotoxic T cells by the ct2M receptor-expressing cells, such that if the level 
measured in (b) differs from the level of said stimulation in the absence of the one or more 
test compounds, then a compound that modulates antigen presentation by a2M receptor- 
expressing cells is identified. 

20 In another embodiment, the invention provides a method for modulating an immune 

response comprising administering to a mammal a purified compound that binds to the <x2M 
receptor in an amount effective to modulate an immune response in the mammal. 

In yet another embodiment, a method for treating or preventing a disease or disorder 
is provided comprising administering to a mammal a purified compound that binds to the 

25 a2M receptor in an amount effective to treat or prevent a disease or disorder in the mammal. 
In one embodiment, the disease or disorder is cancer or an infectious disease. 

In a further embodiment, a method is provided for treating an autoimmune disorder 
comprising administering to a mammal in need of such treatment a purified compound that 
binds to the a2M receptor in an amount effective to treat an autoimmune disorder in the 

30 mammal. 

In another aspect of the invention, a method is provided for stimulating an immune 
response in a patient comprising administering to said patient blood which has been 
withdrawn from said patient and treated to remove an a2M receptor ligand. In a specific 
embodiment, the method further comprises administering to said patient a heat shock protein 
35 or a heat shock protein-antigenic peptide complex. In a specific embodiment, blood is 
administered to said patient by syringe. In another embodiment, said blood is administered 
to said patient by an intravenous drip. 
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Li another embodiment, a method is provided for stimulating an immune response in 
a patient comprising: a) removing a a2M receptor ligand from blood withdrawn from said 
patient; and b) returning at least a portion of the a2M receptor ligand-depleted blood to said 
patient 

5 In another embodiment, a method is provided for stimulating an immune response in 

a patient comprising: a) withdrawing blood from said patient; b) removing a a2M receptor 
ligand from said blood; and c) returning at least a portion of the a2M receptor ligand- 
depleted blood to said patient. In a specific embodiment, the method further comprises after 
step (a) and before step (c) the step of adding a heat shock protein or a heat shock protein 

Iq antigenic-peptide complex to said blood. In a specific embodiment, said blood is returned to 
said patient by syringe. In another specific embodiment, said blood is returned to said 
patient by an intravenous drip. In another specific embodiment, the removing a <x2M 
receptor ligand from the blood comprises the step of contacting the blood with a solid phase 
attached to a a2M receptor ligand-binding molecule for a time period and under conditions 

15 sufficient to allow binding of <x2M receptor ligand to the a2M receptor ligand-binding 
molecule solid phase. In another specific embodiment, the a2M receptor ligand-binding 
molecule is a2M receptor, or a fragment thereof. In another embodiment, said a2M receptor 
ligand-binding molecule does not bind a heat shock protein. In another embodiment, the 
a2M receptor ligand-binding molecule is an <x2M receptor ligand-specific antibody, or a 

20 fragment thereof. 

In various embodiments, an apheresis system is used in said removing step. In other 
embodiments blood is withdrawn manually in said withdrawing step. In various 
embodiments, said removing step comprises separating the plasma from said blood and 
treating said plasma to remove said a2M receptor ligand. 

25 The invention further provides a kit comprising in one or more containers a solid 

phase chromatography column with a purified a2M receptor ligand binding molecule 
attached thereto, such that withdrawn blood can be run over the column to deplete the blood 
of a a2M receptor ligand. In one embodiment, the a2M receptor ligand binding molecule of 
the kit does not bind heat shock proteins. 

30 In various embodiments, the a2M receptor ligand is <x2M, a lipoprotein complex, 

lactoferrin, tissue-type plasminogen activator, urokinase-type plasminogen activator, or an 
exotoxin. 

The term "HSP-a2M receptor-mediated process" as used herein refers to a process 
dependent and/or responsive, either directly or indirectly, to the interaction of HSP with the 
35 a2M receptor. Such processes include processes that result from an aberrant level of 

expression, synthesis and/or activity of a2M receptor, such as endocytic activities relating to 
the binding of the various <x2M ligands, including but not limited to HSP, a2M, lipoprotein 
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complexes, lactoferrin, tissue-type plasminogen activator (tPA), urokinase-type plasminogen 
activator (uP A), and exotoxins. Such processes include, but are not limited to, endocytosis, 
antigen presentation, cholesterol regulation, apoE-containing lipoprotein clearance, and 
chylomicron remnant removal. 

5 The terms "HSP-a2M receptor-related disorder" and fl HSP-a2M receptor-related 

condition", as used herein, refers to a disorder and a condition, respectively, involving a 
HSP-a2M receptor interaction. Such disorders and conditions may result, for example, from 
an aberrant ability of the a2M receptor to interact with HSP, perhaps due to aberrant levels of 
HSP and/or <x2M receptor expression, synthesis and/or activity relative to levels found in 

20 normal, unaffected, unimpaired individuals, levels found in clinically normal individuals, 
and/or levels found in a population whose levels represent a baseline, average HSP and/or 
ct2M receptor levels. Such disorders include, but are not limited to, autoimmune disorders, 
diseases and disorders involving disruption of antigen presentation and/or endocytosis, 
diseases and disorders involving cytokine clearance and/or inflammation, proliferative 

1 5 disorders, viral disorders and other infectious diseases, hypercholesterolemia, Alzheimer's 
disease, diabetes, and osteoporosis. 

The term "a2MR ligand" as used herein, refers to a molecule capable of binding to 
the a2M receptor. Such a2MR ligands include as well as known ligands, such as, but not 
limited to, cc2M and a2M complexes, heat shock proteins and heat shock protein complexes, 

20 lipoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), urokinase-type 
plasminogen activator (uPA), and exotoxins. In addition, a2MR ligands also include 
molecules which can readily be identified as a2MR ligands using standard binding assays 
well known in the art. Such <x2MR ligands are typically endocytosed by cell upon binding to 
the a2M receptor. 

25 

4. BRIEF DESCRIPTION OF THE FIGURES 

FIG. 1 A-C . Identification of an 80 kDa polypeptide as a putative gp96 receptor. A. 
Confocal microscopy of re-presentation-competent RAW264.7 cells stained with gp96-FITC 
(left panel) and with albumin-FITC (right panel). B. SDS-PAGE analysis of detergent 
extracts of plasma membranes from surface biotinylated RAW264.7 (re-presentation- 
competent) or P815 cells (representation-incompetent) eluted from gp96 or albumin- 
Sepharose (SA) columns and stained with silver stain (top) or avidin-peroxidase (bottom). 
C gp96-S ASD-I 125 was cross-linked to live peritoneal macrophages (MO) or P8 1 5 cells, 
and the cell lysates examined by SDS-PAGE and autoradiography. Various components were 
omitted as controls, as indicated. 



-13- 



WO 01/92474 



PCT/US01/18041 



FIG, 2A-B. Anti-p80 antiserum detects an 80 kDa molecule and inhibits re-presentation of 
gp96-chaperoned AH1 peptide by macrophage. A. Pre-immune and immune sera were used 
to probe blots of plasma membrane extracts of RAW264.7, peritoneal macrophages (both 
cell types re-presentation-competent), or P815 cells. B. Re-presentation of gp96-chaperoned 
5 peptide AH1 . Sera were added at the final dilution indicated. The solid cross indicates the 
level of T cell stimulation when the APCs were pulsed directly with the AH1 peptide. The 
open cross indicates the corresponding value with unpulsed APCs. 

FIG. 3A-C. Protein microsequencing of the 80 kDa protein. A. Analysis of a single tryptic 
10 (GALHIYHQR) peptide by tandem- mass spectrometry. All possible b- and y-ion series 
together with identified b-ion series (red) and y-ion series (blue) are shown. B. Collision- 
induced dissociation (CID) spectrum of this peptide is shown. C. Four identified peptides 
from the <x2M receptor, peptide mass, and sequence are shown. 

15 FIG. 4. a2-Macroglobulin inhibits re-presentation of gp96-chaperoned AH1 peptide by 
macrophage. The solid cross indicates the level of T cell stimulation when the APCs were 
pulsed directly with the AH1 peptide. The open cross indicates the corresponding value with 
unpulsed APCs. 

20 FIG. 5. Table of specific binding of HSPs and a2-macroglobulin to primary cultures and 
cell lines of several histological origins. The "**" indicates percentage of cells staining with 
FITC over background staining alone. The "#" indicates that the cells were examined by 
confocal microscopy. All CD1 lc + cells were intensely positive for binding to the three HSPs 
and a2M.. 

25 

FIG. 6A-B. Analysis of cells by flow cytometry for the presence of FITC labelled cells. 
The macrophage cell lines RAW264.7 (A) or RAW309Cr.l (B) were incubated with 
lOOmg/ml of FITC labeled gp96, hsp90, hsp70 or SA. Live cells only were gated based on 
FSC. 

30 

FIG. 7A-B. Re-presentation of gp96-chaperoned peptides by APCs that bind HSPs and a2 
macroglobulin. The presence of IFN-y (pg/ml) was assayed as a marker for CTL stimulation. 
(A) Peritoneal macrophage or BM-DCs from C57B1/6 mice (1X104). (B) RAW 264.7 or 
RAW 309Cr.l macrophage lines were cultured with gp96 (40 mg/ml) by itself or complexed 
35 to the AH1-19 peptide and used to stimulate AH1 specific CTLs (1X104). 
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FIG, 8. Peptides chaperoned by hsp90, CRT, hsp70 and gp96 but not serum albumin are 
re-presented by RAW264.7 cells. The chaperones, uncomplexed or complexed to the 
AH1-19 peptide were used to pulse RAW264.7 cells which were tested for their ability to 
stimulate cognate CTLs. 

5 

FIG. 9A-C. Gp96, hsp90, hsp70 and calreticulin utilize a common receptor for 
re-presentation. (A) RAW264.7 cells were pulsed with gp96-AHl-19 complexes (40 mg/ml 
gp96) in presence of increasing concentrations of uncomplexed gp96, hsp90, hsp70 or SA. 
(B) Re-presentation of AH1-19 complexed to gp96, hsp90> hsp70, CRT or albumin was 
I q carried out in presence of increasing concentrations of a2-macroglobulin. The data is plotted 
as percentage inhibition of re-presentation. (C) Re-presentation of AH1-19 complexed to 
gp96, hsp90, hsp70 or calreticulin in presence of increasing concentrations of anti-CD91 
antibody. The data is plotted as percentage inhibition of re-presentation. 

^5 FIG, 10A-C. Re-presentation of gp96-chaperoned peptides follows the classical endogenous 
antigen presentation pathway. (A) Requirement of proteasomes. Peritoneal macrophage 
(1X106 ) were either treated or untreated with lactacystin (100 mM). The cells were labeled 
with chromium and used as targets against VSV8 specific CTLs. (B) Requirement of TAP as 
measured in vitro. Peritoneal macrophage from TAP+/+ or TAP-/- mice were cultured with 

2o gp96 or gp96-VSV19 complex and VSV8 specific CTL line. Culture supematants were 
tested for the presence of EFN-y (pg/ml) as a marker for CTL stimulation. (Q Requirement 
of TAP as measured in vivo. Gp96-VSV19 complex was injected intraperitoneally. After 10 
days, spleens were removed and cells were cultured in vitro with VSV8. The lymphocyte 
cultures were tested for their ability to lyse EL4 cells (dotted line) or EL4 cells pulsed with 

25 VS V8 peptide (solid line). Each line re-presents one mouse. 

FIG. 11. <x2M receptor is a sensor of necrotic cell death due to its ability to detect 
extracellular gp96. Conversely, receptors (psR) for phosphatidyl serine (ps) detect apoptotic 
cell death. 

30 

FIG. 12A. The mouse a2MR cDNA (SEQ ID NO:l) and predicted open reading frame of 
murine a2MR protein (Genbank accession no. CAA47817). B. The murine a2M protein 
(SEQ ID NO:2), with residues identified by microsequencing an 80 kDa, gp96-interacting 
fragment of the receptor highlighted in bold. 

35 

FIG. 13A. The human a2M cDNA (SEQ ID NO:3) and predicted open reading frame of 
o2M protein (SEQ ID NO:4)(Genbank accession no. Ml 1313). B. The sequence of the 
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mature human a2M protein (SEQ ID NO:5), following cleavage of the N-terminal 23 amino 
acid signal sequence. Highlighted residues represent the 138 amino acid a2MR-binding 
domain (RBD). Underlined residues represent an extension of the RBD that is present in a 
a2MR-binding, proteolytic fragment of a2M (RBDv). Bolded residues have been shown to 
5 be important for <x2MR binding. Italicized residues represent a domain that is conserved 
among ligands of <x2MR. 

BIG. 14A. The human a2MR cDNA (SEQ ID NO:6) and predicted open reading frame of 
human a2MR protein (Genbank accession no. NP_002323). B. Primary amino acid sequence 

1 0 of human a2MR (SEQ ID NO:7). The approximate locations of complement repeat clusters 
I and II are highlighted in grey. Individual complement repeats of Cl-II are indicated as 
follows: amino acids of CR3, 5, 7 and 9 are in italics, and amino acids of CR4, 6, 8, and 10 
are underlined. Amino acids highlighted in bold were present in an 80kDa peptide fragment 
of the mouse a2MR that bound to gp96. The double underlined residues represent the 

15 predicted signal peptide. For the locations of other features of the receptor, such as the EGF 
repeats, see the article by (Herz et al 9 1988, EMBO J. 7:41 19-4127). 



5. DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to compositions and methods for the use of the alpha (2) 
macroglobulin receptor (also referred to interchangeably herein as "a2MR" or "the a2M 
receptor") as a heat shock protein ( <f HSP") receptor. In particular, the present invention 
provides compositions comprising isolated a2MR- ligand complexes, e.g., a2MR-HSP 
complexes, including isolated and/or recombinant cells, and antibodies, molecules and 
compounds that modulate the interaction of a2MR with an a2MR ligand, such as HSP. The 
invention further encompasses methods for the use of a2MR as a heat shock protein receptor, 
including screening assays to identify compounds that modulate the interaction of a2MR 
with an HSP, or other <x2MR ligand, and methods for the use of these molecules and 
complexes for the diagnosis and treatment of immune disorders, proliferative disorders, and 
infectious diseases. 

The term "a2MR ligand" as used herein, refers to a molecule capable of binding to 
the a2M receptor. Such <x2MR ligands include as well as known ligands, such as, but not 
limited to, a2M and <x2M complexes, heat shock proteins and heat shock protein complexes, 
lipoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), urokinase-type 
plasminogen activator (uPA), and exotoxins. In addition, a2MR ligands also include 
molecules which can readily be identified as a2MR ligands using standard binding assays 
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well known in the art. Such a2MR ligands are typically endocytosed by cell upon binding to 
a2MR. 

An HSP useful in the practice of the invention may be selected from among any 
cellular protein that satisfies any one of the following criteria: the intracellular concentration 

5 of an HSP increases when a cell is exposed to a stressful stimulus; an HSP can bind other 
proteins or peptides, and can release the bound proteins or peptides in the presence of 
adenosine triphosphate (ATP) or low pH; or an HSP possesses at least 35% homology with 
any cellular protein having any of the above properties. Preferably, the HSP used in the 
compositions and methods of the present invention includes, but are not limited to, HSP90, 

10 8P 96 > Bip > Hsp70, DnaK, Hsc70, PhoE calreticulin, PDI, or an sHsp, alone or in 
combination. 

In a preferred embodiment, an HSP is a mammalian {e.g., mouse, rat, primate, 
domestic animal such as dog, cat, cow, horse), and is most preferably, human. 

Hsps useful in the practice of the invention include, but are not limited to, members 
15 of the HSP60 family, HSP70 family, HSP90 family, HSP100 family, sHSP family, 

calreticulin, PDI, and other proteins in the endoplasmic reticulum that contain thioredoxin- 
like domain(s), such as, but not limited to, ERp72 and ERp61. 

HSP analogs, muteins, derivatives, and fragments can also be used in place of HSPs 
according to the invention. An HSP peptide-binding "fragment" for use in the invention 
20 refers to a polypeptide comprising a HSP peptide-binding domain that is capable of 
becoming non-covalently associated with a peptide to foim a complex that is capable of 
eliciting an immune response. In one embodiment, an HSP peptide-binding fragment is a 
polypeptide comprising an HSP peptide-binding domain of approximately 100 to 200 amino 
acids. 

25 Databases can also be searched to identify sequences with various degrees of 

similarities to a query sequence using programs, such as FASTA and BLAST, which rank the 
similar sequences by alignment scores and statistics. Such nucleotide sequences of non- 
limiting examples of HSPs that can be used for preparation of the HSPs used in the methods 
of the invention are as follows: human Hsp70, Genbank Accession No. NM_005345, 

30 Sargent et al. 9 1989, Proc. Natl. Acad. Sci. U.S.A., 86:1968-1972; human Hsp90, Genbank 
Accession No. X15183, Yamazaki etal, Nucl. Acids Res. 17:7108; human gp96: Genbank 
AccessionNo. X15187, Maki et al, 1990, Proc. Natl. Acad Sci., 87: 5658-5562; human BiP: 
Genbank Accession No. Ml 9645; Ting et aL 9 1988, DNA 7: 275-286; human Hsp27, 
Genbank Accession No. M24743; Hickey et al> 1986, Nucleic Acids Res. 14:4127-45; 

35 mouse Hsp70: Genbank Accession No. M35021, Hunt et al> 1990, Gene, 87:199-204; 
mouse gp96: Genbank Accession No. M16370, Srivastava et aL, 1987, Proc. Natl. Acad. 
Sci., 85:3807-3811; and mouse BiP: Genbank Accession No. U16277, Haas etaL, 1988, 
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Proc. Natl. Acad. Sci. U.S.A., 85: 2250-2254. Due to the degeneracy of the genetic code, the 
term "HSP sequence", as used herein, refers not only to the naturally occurring amino acid 
and nucleotide sequence but also encompasses all the other degenerate sequences that encode 
the HSP. 

5 The aforementioned HSP families also contain proteins that are related to HSPs in 

sequence, for example, having greater than 35% amino acid identity, but whose expression 
levels are not altered by stress. Therefore, it is contemplated that the definition of heat shock 
or stress protein, as used herein, embraces other proteins, mutants, analogs, and variants 
thereof having at least 35% to 55%, preferably 55% to 75%, and most preferably 75% to 

1 o 85% amino acid identity with members of these families whose expression levels in a cell are 
enhanced in response to a stressful stimulus. The determination of percent identity between 
two sequences can also be accomplished using a mathematical algorithm. A preferred, 
non-limiting example of a mathematical algorithm utilized for the comparison of two 
sequences is the algorithm of Karlin and Altschul, 1990, Proc. Natl. Acad Sci. USA 

15 87:2264-2268, modified as in Karlin and Altschul, 1993, Proc. Natl. Acad. Sci. USA 

90:5873-5877. Such an algorithm is incorporated into the NBLAST and XBLAST programs 
of Altschul, et al., 1990, J. Mol. Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide 
sequences homologous to a nucleic acid molecules of the invention. BLAST protein 

20 searches can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain 
amino acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et al 9 1997, Nucleic Acids Res.25:3389-3402. Alternatively, PSI-Blast can be used 
to perform an iterated search which detects distant relationships between molecules (Altschul 

25 et al, 1997, supra). When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the 
default parameters of the respective programs (e.g-., XBLAST and NBLAST) can be used 
(see http://www,ncbi.nlm.nih.gov). Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of Myers 
and Miller, 1988, CABIOS 4:1 1-17. Such an algorithm is incorporated into the ALIGN 

30 program (version 2.0) which is part of the GCG sequence alignment software package. 

When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight 
residue table, a gap length penalty of 12, and a gap penalty of 4 can be used. 

The immunogenic HSP-peptide complexes of the invention may include any complex 
containing an HSP and a peptide that is capable of inducing an immune response in a 

35 mammal. The peptides are preferably noncovalently associated with the HSP. Preferred 
complexes may include, but are not limited to, gp96-peptide complexes, HSP90-peptide 
complexes, HSP70-peptide complexes, HSP60-peptide complexes, HSPlOO-peptide 
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complexes, calreticulin-peptide complexes, and sHSP-peptide complexes. For example, the 
HSP gp96 which is present in the endoplasmic reticulum of eukaryotic cells and is related to 
the cytoplasmic HSP90 f s can be used to generate an effective vaccine containing a gp96- 
peptide complex. 

5 The HSPs, a2MR, and/or antigenic molecules for use in the invention can be purified 

from natural sources, chemically synthesized, or recombinantly produced. Although the 
HSPs may be allogeneic to the patient, in a preferred embodiment, the HSPs are autologous 
to the patient to whom they are administered. 

10 5.1 COMPOSITIONS OF THE INVENTION 

The present invention provides compositions that modulate the interaction between 
<x2MR and an <x2MR ligand, such as, for example, an HSP. Such compositions can be used 
in methods to elicit or modulate an immune response. Such compositions also include 
antibodies that specifically recognize HSP- <x2MR complexes, isolated cells that express 

15 HSP-a2MR complexes, and isolated and recombinant cells that contain recombinant a2MR 
and HSP sequences. In addition, in various methods of the invention, sequences encoding 
ct2MR, an HSP, and <x2M are used for immunotherapy. Such compositions can be used, for 
example, in immunotherapy against proliferative disorders, infectious diseases, and other 
HSP-a2MR-related disorders. Methods for the synthesis and production of such 

20 compositions are described herein. 

5.1.1 RECOMBINANT EXPRESSION 

In various embodiments of the invention, sequences encoding the a2MR, an HSP, 
a2M, or other a2MR ligand are inserted into an expression vector for propagation and 
25 expression in recombinant cells. Thus, in one embodiment, the a2M receptor, HSP, a2M, or 
other a2MR ligand coding region is linked to a non-native promoter for expression in 
recombinant cells. 

The amino acid sequence of the portion of <x2MR that recognizes and binds to HSPs 
is shown in FIG. 12B (SEQ ED NO:2). Based on the discovery by the Applicant, this 

30 portion of a2MR is responsible for recognizing and binding to HSPs and HSP-antigenic 
peptide complexes. After binding HSPs, a2MR facilitates transport of the HSP-antigenic 
peptide complex into the cell, where the peptide antigens associate with MHC class I 
molecules and are then presented on the cell surface of the cell, and become available to 
stimulate an immune response. Based on this invention, compositions comprising agonists 

35 and antagonists of <x2MR and HSPs interactions can be used to modulate the immune 

response. Thus, recombinant a2MR polypeptides, complexes of a2MR and an HSP or HSP- 
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antigenic peptide complexes, and recombinant cells expressing <x2MR or complexes 
comprising a2MR and antigenic peptides can be used in methods for immunotherapy and 
diagnostic methods described herein, 

In various embodiments of the invention, sequences encoding the a2MR, and/or a 

5 heat shock protein or a2M, or fragments thereof, are inserted into an expression vector for 
propagation and expression in recombinant cells. An expression construct, as used herein, 
refers to a nucleotide sequence encoding a particular gene product, such as the a2MR, HSP 
or a2M, operably associated with one or more regulatory regions which allows expression of 
the encoded gene product in an appropriate host cell. "Operably-associated" refers to an 

10 association in which the regulatory regions and the nucleotide sequence encoding the gene 
product to be expressed are joined and positioned in such a way as to permit transcription, 
and ultimately, translation. 

The DNA may be obtained from known sequences derived from sequence databases 
by standard procedures known in the art by DNA amplification or molecular cloning directly 

15 from a tissue, cell culture, or cloned DNA (e.g., a DNA "library"). Any eukaryotic cell may 
serve as the nucleic acid source for obtaining the coding region of an hsp gene. Nucleic acid 
sequences encoding HSPs can be isolated from vertebrate, mammalian, as well as primate 
sources, including humans. Clones derived from genomic DNA may contain regulatory and 
intron DNA regions in addition to coding regions; clones derived from cDNA will contain 

20 only exon sequences. Whatever the source, the hsp gene should be cloned into a suitable 
vector for propagation of the gene. 

Vectors based on E: coli are the most popular and versatile systems for high level 
expression of foreign proteins (Makrides, 1996, Microbiol Rev, 60:512-538). Non-limiting 
examples of regulatory regions that can be used for expression in E. coli may include but not 

25 limited to lac, trp, Ipp, phoA, recA, tac, XP L and phage T3 and T7 promoters (Makrides, 
1996, Microbiol Rev, 60:512-538). Non-limiting examples of prokaryotic expression vectors 
may include the Xgt vector series such as Xgtl 1 (Huynh et al., 1984 in "DNA Cloning 
Techniques", Vol. I: A Practical Approach (D. Glover, ed.), pp. 49-78, IRL Press, Oxford), 
and the pET vector series (Studier et al., 1990, Methods Enzymol., 185:60-89). However, a 

30 potential drawback of a prokaryotic host-vector system is the inability to perform many of 
the post-translational processing events of mammalian cells. Thus, an eukaryotic host-vector 
system is preferred, a mammalian host-vector system is more preferred, and a human host- 
vector system is the most preferred. 

The regulatory regions necessary for transcription of an a2MR sequence, for 

35 example, can be provided by the expression vector. A translation initiation codon (ATG) 
may also be provided to express a nucleotide sequence encoding an a2M receptor that lacks 
an initiation codon. In a compatible host-construct system, cellular proteins required for 
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transcription, such as RNA polymerase and transcription factors, will bind to the regulatory 
regions on the expression construct to effect transcription of the a2MR sequence in the host 
organism. The precise nature of the regulatory regions needed for gene expression may vary 
from host cell to host cell. Generally, a promoter is required which is capable of binding 

5 RNA polymerase to initiate the transcription of an operably-associated nucleic acid 

sequence. Such regulatory regions may include those 5 f -non-coding sequences involved with 
initiation of transcription and translation, such as the TATA box, the cap site, a CAAT box, 
and the like. The non-coding region 3' to the coding sequence may contain transcriptional 
termination regulatory sequences, such as terminators and polyadenylation sites. 

10 Both constitutive and inducible regulatory regions may be used for expression of the 

a2M receptor, HSP, <x2M, or other <x2MR ligand. It may be desirable to use inducible 
promoters when the conditions optimal for growth of the recombinant cells and the 
conditions for high level expression of the gene product are different. Examples of useful 
regulatory regions are provided in the next section below. 

1 5 For expression of the a2M receptor, HSP, a2M, or other a2MR ligand gene product 

in mammalian host cells, a variety of regulatory regions can be used, for example, the SV40 
early and late promoters, the cytomegalovirus (CMV) immediate early promoter, and the 
Rous sarcoma virus long terminal repeat (RSV-LTR) promoter. Inducible promoters that 
may be useful in mammalian cells include but are not limited to those associated with the 

20 metallothionein II gene, mouse mammary tumor virus glucocorticoid responsive long 

terminal repeats (MMTV-LTR), the P-interferon gene, and the Hsp70 gene (Williams et al, 
1989, Cancer Res. 49:2735-42; Taylor et al. 9 1990, Mol. Cell Biol, 10:165-75). It may be 
advantageous to use heat shock promoters or stress promoters to drive expression of a2MR 
in recombinant host cells. 

25 The following animal regulatory regions, which exhibit tissue specificity and have 

been utilized in transgenic animals, can also be used in tumor cells of a particular tissue type: 
elastase I gene control region which is active in pancreatic acinar cells (Swift et al., 1984, 
Cell 38:639-646; Ornitz et al., 1986, Cold Spring Harbor Symp. Quant. Biol. 50:399-409; 
MacDonald, 1987, Hepatology 7:425-515); insulin gene control region which is active in 

30 pancreatic beta cells (Hanahan, 1985, Nature 315:11 5-122), immunoglobulin gene control 
region which is active in lymphoid cells (Grosschedl et al., 1984, Cell 38:647-658; Adames 
et al., 1985, Nature 318:533-538; Alexander et al., 1987, Mol. Cell. Biol. 7:1436-1444), 
mouse mammary tumor virus control region which is active in testicular, breast, lymphoid 
and mast cells (Leder et al., 1986, Cell 45:485-495), albumin gene control region which is 

35 active in liver (Pinkert et al, 1987, Genes and DeveL 1 :268-276), alpha-fetoprotein gene 
control region which is active in liver (Krumlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; 
Hammer et al., 1987, Science 235:53-58; alpha 1-antitrypsin gene control region which is 
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active in the liver (Kelsey et al., 1987, Genes and Devel 1:161-171), beta-globin gene 
control region which is active in myeloid cells (Mogram et al., 1985, Nature 315:338-340; 
Kollias et al., 1986, Cell 46:89-94; myelin basic protein gene control region which is active 
in oligodendrocyte cells in the brain (Readhead et al., 1987, Cell 48:703-712); myosin light 

5 chain-2 gene control region which is active in skeletal muscle (Sani, 1 985, Nature 3 14:283- 
286), and gonadotropic releasing hormone gene control region which is active in the 
hypothalamus (Mason et al., 1986, Science 234:1372-1378). 

The efficiency of expression of the a2M receptor in a host cell may be enhanced by 
the inclusion of appropriate transcription enhancer elements in the expression vector, such as 

10 & ose found in SV40 virus, Hepatitis B virus, cytomegalovirus, immunoglobulin genes, 
metallothionein, P-actin (see Bittner et al., 1987, Methods in Enzymol. 153:516-544; 
Gorman, 1990, Curr. Op. in Biotechnol. 1:36-47). 

The expression vector may also contain sequences that permit maintenance and 
replication of the vector in more than one type of host cell, or integration of the vector into 

15 the host chromosome. Such sequences may include but are not limited to replication origins, 
autonomously replicating sequences (ARS), centromere DNA, and telomere DNA. It may 
also be advantageous to use shuttle vectors that can be replicated and maintained in at least 
two types of host cells. 

In addition, the expression vector may contain selectable or screenable marker genes 

20 for initially isolating or identifying host cells that contain DNA encoding an <x2M receptor. 
For long term, high yield production of a2M receptor, stable expression in mammalian cells 
is preferred. A number of selection systems may be used for mammalian cells, including, but 
not limited, to the Herpes simplex virus thymidine kinase (Wigler et al., 1977, Cell 1 1 :223), 
hypoxanthine-guanine phosphoribosyltransferase (Szybalski and Szybalski, 1962, Proc. Natl. 

25 Acad. Sci. USA 48:2026), and adenine phosphoribosyltransferase (Lowy et al., 1980, Cell 
22:817) genes can be employed in tk\ hgprf or aprf cells, respectively. Also, antimetabolite 
resistance can be used as the basis of selection for dihydrofolate reductase {dhfr\ which 
confers resistance to methotrexate (Wigler et al., 1980, Natl. Acad. Sci. USA 77:3567; 
O'Hare et al., 1981, Proc. Natl. Acad. Sci. USA 78:1527); gpt, which confers resistance to 

30 mycophenolic acid (Mulligan & Berg, 1981, Proc. Natl. Acad. Sci. USA 78:2072); neomycin 
phosphotransferase (neo), which confers resistance to the aminoglycoside G-418 (Colbeire- 
Garapin et al., 1981, J. Mol. Biol. 150:1); and hygromycin phosphotransferase (hyg), which 
confers resistance to hygromycin (Santerre et al., 1984, Gene 30:147). Other selectable 
markers, such as but not limited to histidinol and Zeocin™ can also be used. 

35 In order to insert the DNA sequence encoding a2M receptor, HSP, a2M, or other 

a2MR ligand into the cloning site of a vector, DNA sequences with regulatory functions, 
such as promoters, must be attached to DNA sequences encoding the <x2M receptor, HSP, 
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a2M, or other a2MR ligand, respectively. To do this, linkers or adapters providing the 
appropriate compatible restriction sites may be ligated to the ends of cDNA or synthetic 
DNA encoding an a2M receptor, by techniques well known in the art (Wu et al, 1987, 
Methods in Enzymol 152:343-349). Cleavage with a restriction enzyme can be followed by 
5 modification to create blunt ends by digesting back or filling in single-stranded DNA termini 
before ligation. Alternatively, a desired restriction enzyme site can be introduced into a 
fragment of DNA by amplification of the DNA by use of PCR with primers containing the 
desired restriction enzyme site. 



10 operably associated with regulatory regions can be directly introduced into appropriate host 
cells for expression and production of a2MR without further cloning (see, for example, U.S. 
Patent No. 5,580,859). The expression constructs may also contain DNA sequences that 
facilitate integration of the <x2M receptor sequence into the genome of the host cell, e.g., via 
homologous recombination. In this instance, it is not necessary to employ an expression 

1 5 vector comprising a replication origin suitable for appropriate host cells in order to propagate 
and express the <x2M receptor in the host cells. 

Expression constructs containing cloned nucleotide sequence encoding the a2M 
receptor, an HSP, a2M, or other a2MR ligand, can be introduced into the host ceU by a 
variety of techniques known in the art, including but not limited to, for prokaryotic cells, 

20 bacterial transformation (Hanahan, 1985, in DNA Cloning, A Practical Approach, 1:109- 
136), and for eukaryotic cells, calcium phosphate mediated transfection (Wigler et al., 1977, 
Cell 11:223-232), liposome-mediated transfection (Schaefer-Ridder et al., 1982, Science 
215:166-168), electroporation (Wolff et al., 1987, Proc Natl Acad Sci 84:3344), and 
microinjection (Cappechi, 1980, Cell 22:479-488). 

25 For long term, high yield production of properly processed a2M receptor, HSP, a2M, 

or other a2MR ligand, stable expression in mammalian cells is preferred. Cell lines that 
stably express the a2M receptor, HSP, a2M, or other a2MR ligand or a2MR-peptide 
complexes may be engineered by using a vector that contains a selectable marker. By way of 
example but not limitation, following the introduction of the expression constructs, 

30 engineered cells may be allowed to grow for 1-2 days in an enriched media, and then are 
switched to a selective media. The selectable marker in the expression construct confers 
resistance to the selection and optimally allows cells to stably integrate the expression 
construct into their chromosomes and to grow in culture and to be expanded into cell lines. 
Such cells can be cultured for a long period of time while the desired gene product is 

35 expressed continuously. 

The recombinant cells may be cultured under standard conditions of temperature, 
incubation time, optical density, and media composition. Alternatively, recombinant 



In one embodiment, an expression construct comprising an a2M receptor sequence 
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antigenic cells may be cultured under conditions emulating the nutritional and physiological 
requirements of the cancer cell or infected cell. However, conditions for growth of 
recombinant cells may be different from those for expression of the a2M receptor, HSPs, 
a2M, or other a2MR ligand, or antigenic peptide. 

5 

5.1.2 PEPTIDE SYNTHESIS 

An alternative to producing peptides and polypeptides comprising HSP, <x2M 
receptor, a2M or other a2MR ligand sequences, by recombinant techniques is peptide 
synthesis. For example, a peptide corresponding to a portion of an HSP or an <x2M peptide 

10 comprising the receptor-binding domain, which can be used as an antagonist in the 

therapeutic methods described herein, can be synthesized by use of a peptide synthesizer. 
Synthetic peptides corresponding to a2M receptor sequences useful for therapeutic methods 
described herein can also be produced synthetically. Conventional peptide synthesis may be 
used or other synthetic protocols well known in the art. 

15 For example, peptides having the amino acid sequence of the <x2M receptor, an HSP, 

a2M, or other <x2MR ligand, or an analog, mutein, fragment, or derivative thereof, may be 
synthesized by solid-phase peptide synthesis using procedures similar to those described by 
Merrifield, 1963, J. Am. Chem. Soc, 85:2149. During synthesis, N-a-protected amino acids 
having protected side chains are added stepwise to a growing polypeptide chain linked by its 

20 C-terminal and to an insoluble polymeric support i.e., polystyrene beads. The peptides are 
synthesized by linking an amino group of an N-a-deprotected amino acid to an a-carboxyl 
group of an N-a-protected amino acid that has been activated by reacting it with a reagent 
such as dicyclohexylcarbodiimide. The attachment of a free amino group to the activated 
carboxyl leads to peptide bond formation. The most commonly used N-cc-protecting groups 

25 include Boc which is acid labile and Fmoc which is base labile. Details of appropriate 
chemistries, resins, protecting groups, protected amino acids and reagents are well known in 
the art and so are not discussed in detail herein (See, Atherton, et al 9 1989, Solid Phase 
Peptide Synthesis: A Practical Approach, IRL Press, and Bodanszky, 1993, Peptide 
Chemistry, A Practical Textbook, 2nd Ed., Springer- Verlag). 

5° Purification of the resulting a2M receptor, HSP, a2M, or other a2MR ligand peptides 

is accomplished using conventional procedures, such as preparative HPLC using gel 
permeation, partition and/or ion exchange chromatography. The choice of appropriate 
matrices and buffers are well known in the art and so are not described in detail herein. 
In addition, analogs and derivatives of a2M receptor, HSP, a2M, or other a2MR 

55 ligand protein can be chemically synthesized. Furthermore, if desired, nonclassical amino 
acids or chemical amino acid analogs can be introduced as a substitution or addition into the 
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ct2M receptor, HSP, a2M, or other a2MR ligand sequence. Non-classical amino acids 
include but are not limited to the D-isomers of the common amino acids, a-amino isobutyric 
acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, y-Abu, 8-Ahx, 6-amino hexanoic acid, 
Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, norleucine, norvaline, 
5 hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, t-butylalanine, 

phenylglycine, cyclohexylalanine, 0-alanine, fluoro-amino acids, designer amino acids such 
as P-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid 
analogs in general. 

10 5.1.3 ANTIBODIES SPECIFIC FOR a2M RECEPTOR-HSP COMPLEXES 

Described herein are methods for the production of antibodies capable of specifically 
recognizing <x2M receptor epitopes, HSP-a2M receptor complex epitopes or epitopes of 
conserved variants or peptide fragments of the receptor or receptor complexes. Such 
antibodies are useful for therapeutic and diagnostic methods of the invention. 

15 Such antibodies may include* but are not limited to, polyclonal antibodies, 

monoclonal antibodies (mAbs), humanized or chimeric antibodies, single chain antibodies, 
Fab fragments, F(ab') 2 fragments, fragments produced by a Fab expression library, anti- 
idiotype (anti-Id) antibodies, and epitope-binding fragments of any of the above. Such 
antibodies may be used, for example, in the detection of an <x2M receptor or HSP-a2M 

20 receptor complex in an biological sample. Such antibodies may also be utilized in 

conjunction with, for example, compound screening schemes, as described below, in Section 
5.2, for the evaluation of the effect of test compounds on the interaction between HSPs and 
the a2M receptor. 

Anti-a2M receptor complex antibodies may additionally be used as a method for the 
25 inhibition of abnormal receptor product activity. Thus, such antibodies may, be utilized as 
part of treatment methods for HSP-a2M receptor related disorders, e.g., autoimmune 
disorders. 

For the production of antibodies against a2M receptor or receptor complexes, various 
host animals may be immunized by injection with an a2M receptor or HSP-a2M receptor 
30 complex, or a portion thereof. An antigenic portion of <x2M receptor or HSP-a2M receptor 
complex can be readily predicted by algorithms known in the art. 

Host animals may include, but are not limited to rabbits, mice, and rats, to name but a 
few. Various adjuvants may be used to increase the immunological response, depending on 
the host species, including but not limited to Freund's (complete and incomplete), mineral 
35 gels such as aluminum hydroxide, surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, dinitrophenol, and 
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potentially useful human adjuvants such as BCG (bacille Calmette-Guerin) and 
Corynebacterium parvum. 

Polyclonal antibodies are heterogeneous populations of antibody molecules derived 
from the sera of animals immunized with an antigen, such as an a2M receptor or HSP-a2M 

5 receptor complex, or an antigenic functional derivative thereof. For the production of 
polyclonal antibodies, host animals such as those described above, may be immunized by 
injection with <x2M receptor or H3P-a2M receptor complex, or portion thereof, supplemented 
with adjuvants as also described above. 

Monoclonal antibodies, which are homogeneous populations of antibodies to a 

10 particular antigen, may be obtained by any technique that provides for the production of 
antibody molecules by continuous cell lines in culture. These include, but are not limited to, 
the hybridoma technique of Kohler and Milstein, (1975, Nature 256, 495-497; and U.S. 
Patent No. 4,376,1 10), the human B-cell hybridoma technique (Kosbor et al, 1983, 
Immunology Today 4: 72; Cole et al, 1983, Proc. Natl. Acad. Sci. USA 80, 2026-2030), and 

1 5 the EB V-hybridoma technique (Cole et al , 1985, Monoclonal Antibodies And Cancer 
Therapy, Alan R. Liss, Inc., pp. 77-96). Such antibodies may be of any immunoglobulin 
class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. The hybridoma producing 
the mAb of this invention may be cultivated in vitro or in vivo. Production of high titers of 
mAbs in vivo makes this the presently preferred method of production. 

20 In addition, techniques developed for the production of "chimeric antibodies" 

(Morrison, et al, 1984, Proc. Natl. Acad. Sci., 81: 6851-6855; Neuberger, et al, 1984, 
Nature 312: 604-608; Takeda, et al, 1985, Nature, 314: 452-454) by splicing the genes from 
a mouse antibody molecule of appropriate antigen specificity together with genes from a 
human antibody molecule of appropriate biological activity can be used. A chimeric 

25 antibody is a molecule in which different portions are derived from different animal species, 
such as those having a variable region derived from a murine mAb and a human 
immunoglobulin constant region (see, e.g., Cabilly et al, U.S. Patent No. 4,816,567; and 
Boss et al, U.S. Patent No. 4,816397, which are incorporated herein by reference in their 
entirety). 

30 In an additional embodiment of the invention, monoclonal antibodies can be 

produced in germ-free animals (see PCT International Publication No. WO 89/12690, 
published December 12, 1989). According to the invention, human antibodies may be used 
and can be obtained by using human hybridomas (Cote et al, 1983, Proc. Natl. Acad. Sci. 
U.S.A. 80:2026-2030) or by transforming human B cells with EBV virus in vitro (Cole et al, 

35 1985, in Monoclonal Antibodies a nd Cancer Therapy. Alan R. Liss, pp. 77-96). Techniques 
developed for the production of "chimeric antibodies" (Morrison et al, 1984, Proc. Natl. 
Acad. Sci. U.S.A. 81:6851-6855; Neuberger et al, 1984, Nature 312:604-608; Takeda et al, 

-26- 



WO 01/92474 



PCT/US01/18041 



1 985, Nature 3 14:452-454) by splicing the genes from a mouse antibody molecule specific 
for an a2M receptor-HSP complex together with genes from a human antibody molecule of 
appropriate biological activity can also be used; such antibodies are within the scope of this 
invention. 

5 Humanized antibodies are also provided (see U.S. Patent No. 5,225,539 by Winter). 

An iminunoglobuin light or heavy chain variable region consists of a "framework" region 
interrupted by three hypervariable regions, referred to as complementarity determining 
regions (CDRs). The extent of the framework region and CDRs have been precisely defined 
(see, "Sequences of Proteins of Immunological Interest", Kabat, E. et a/., U.S. Department of 

10 Health and Human Services (1983). Briefly, humanized antibodies are antibody molecules 
from non-human species having one or more CDRs from the non-human species and a 
framework region from a human immunoglobulin molecule. Such CDRS-grafted antibodies 
have been successfully constructed against various antigens, for example, antibodies against 
IL-2 receptor as described in Queen et aL, 1989, Proc. Natl. Acad. Sci. USA 86:10029; 

15 antibodies against the cell surface receptor CAMPATH as described in Riechmann et aL, 
1988, Nature 332:323; antibodies against hepatitis B in Co et aL, 1991, Proc. Natl. Acad. Sci. 
USA 88:2869; as well as against viral antigens of the respiratory syncytial virus in Tempest 
et aL 9 1991, Bio-Technology 9:267. Humanized antibodies are most preferred for 
therapeutic use in humans. 

20 Alternatively, techniques described for the production of single chain antibodies 

(U.S. Patent 4,946,778; Bird, 1988, Science 242: 423-426; Huston et aL, 1988, Proc. Natl. 
Acad. Sci. USA 85: 5879-5883; and Ward et aL, 1989, Nature 334: 544-546) can be adapted 
to produce single chain antibodies against a2M receptor or HSP-a2M receptor complexes, or 
portions thereof. Single chain antibodies are formed by linking the heavy and light chain 

25 fragments of the Fv region via an amino acid bridge, resulting in a single chain polypeptide. 
Antibody fragments that recognize specific epitopes may be generated by known 
techniques. For example, such fragments include but are not limited to: the F(ab , ) 2 
fragments, which can be produced by pepsin digestion of the antibody molecule and the Fab 
fragments, which can be generated by reducing the disulfide bridges of the F(ab f ) 2 fragments. 

3 q Alternatively, Fab expression libraries may be constructed (Huse et aL, 1989, Science, 246: 
1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with the 
desired specificity. 

Antibodies to the <x2M receptor can, in turn, be utilized to generate anti-idiotype 
antibodies that "mimic" the a2M receptor, using techniques well known to those skilled in 
35 the art (see, e.g. 9 Greenspan & Bona, 1993, FASEB J 7(5):437-444; and Nissinoff, 1991, J. 
Immunol. 147(8):2429-2438). For example antibodies which bind to the a2M receptor ECD 
and competitively inhibit the binding of HSPs to the a2M receptor can be used to generate 
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anti-idiotypes that "mimic" the ECD and, therefore, bind and neutralize HSPs. Such 
neutralizing anti-idiotypes or Fab fragments of such anti-idiotypes can be used in therapeutic 
regimens to neutralize the native ligand and treat HSP-a2M receptor-related disorders, such 
as immunological disorders, proliferative disorders, and infectious diseases. 

5 Alternatively, antibodies to the a2M receptor that can act as agonists of the <x2M 

receptor activity can be generated. Such antibodies will bind to the a2M receptor and 
activate the signal transducing activity of the receptor. In addition, antibodies that act as 
antagonist of the <x2M receptor activity, i.e. inhibit the activation of the <x2M receptor would 
be particularly useful for treating autoimmune disorders, proliferative disorders, such as 

IQ cancer, and infectious diseases. Methods for assaying for such agonists and antagonists are 
described in detail in Section 5.2, below. 



15 



20 



25 



30 



35 



5.2 ASSAYS FOR THE IDENTIFICATION OF COMPOUNDS THAT INTERACT 
WITH THE a2M RECEPTOR 

The present invention is based on the discovery that the a2M receptor recognizes 
HSP-antigenic peptide complexes and transports them within the cell for the purpose of 
presenting such antigenic molecules to cells of the immune system and eliciting an immu ne 
response. Thus, methods for identifying compounds that interact with the receptor, or 
enhance or block the function of the receptor, are included in the invention. The present 
invention provides in vitro and in vivo assay systems, described in the subsections below, 
which can be used to identify compounds or compositions that interact with the a2M 
receptor, or modulate the activity of the a2M receptor and its interaction with HSPs or HSP- 
peptide complexes. 

The invention provides screening methodologies useful in the identification of small 
molecules, proteins and other compounds which interact with the ot2M receptor, or modulate 
the interaction of HSPs with the a2M receptor. Such compounds may bind Hie a2M 
receptor genes or gene products with differing affinities, and may serve as regulators of 
receptor activity in vivo with useful therapeutic applications in modulating the immune 
response. For example, certain compounds that inhibit receptor function may be used in 
patients to downregulate destructive immune responses which are caused by cellular release 
ofHSPs. 

Methods to screen potential agents for their ability to interact with the <x2M receptor, 
or modulate <x2M receptor expression and activity can be designed based on the inventor's 
discovery of the receptor and its role in HSP or HSP-peptide complex binding and 
recognition. <x2M receptor protein, nucleic acids, and derivatives can be used in screening 
assays to detect molecules that specifically bind to HSP proteins, derivatives, or nucleic 
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acids, and thus have potential use as agonists or antagonists of the <x2M receptor, to modulate 
the immune response. In a preferred embodiment, such assays are performed to screen for 
molecules with potential utility as anti-autoimmune disease, anti-cancer and anti-infective 
drugs (such as anti-viral drugs and antibiotic drugs), or lead compounds for drug 

5 development For example, recombinant cells expressing a2M receptor nucleic acids can be 
used to recombinantly produce a2M receptor in these assays, to screen for molecules that 
interfere with the binding of HSPs to the a2M receptor. Similar methods can be used to 
screen for molecules that bind to the a2M receptor derivatives or nucleic acids. Methods that 
can be used to carry out the foregoing are commonly known in the art. 

10 Compounds capable of specifically binding the a2M receptor can be useful for 

immunotherapy. In one embodiment, an assay is disclosed for identifying compounds that 
specifically bind the <x2M receptor comprising: (a) contacting an a2M receptor with one or 
more test compounds under conditions conducive to binding; and (b) identifying one or more 
test compounds which specifically bind to the a2M receptor, such that a compound capable 

15 of specifically binding the a2M receptor is identified as a compound useful for 
immunotherapy. 

Another method encompassed by the invention for identifying a compound useful for 
immunotherapy involves identifying a compound which modulates the binding of an <x2M 
receptor ligand to the a2M receptor. The term "a2M receptor ligand" as used herein, refers 

20 to an molecule capable of binding to the a2M receptor. Such a2M receptor ligands include, 
but are not limited to, a2M and a2M complexes, heat shock proteins and heat shock protein 
complexes, lipoprotein complexes, lactoferrin, tissue-type plasminogen activator (tPA), 
urokinase-type plasminogen activator (uPA), and exotoxins. Such ligands are typically 
endocytosed by cell upon binding to the o2M receptor. The method comprises the steps of: 

25 (a) contacting an a2M receptor with an a2M receptor ligand, or fragment, or analog, 
derivative or mimetic thereof, in the presence of one or more test compound; and (b) 
measuring the amount of a2M receptor ligand, or fragment, analog, derivative or mimetic 
thereof, bound to the <x2M receptor, such that if the amount of bound a2M receptor ligand. 
measured in (b) differs from the amount of bound a2M receptor measured in the absence of 

30 the test compound, then a compound useful for immunotherapy that modulates the binding of 
an a2M receptor ligand to the a2M receptor is identified. 

In another embodiment, a method for identifying a compound useful for 
immunotherapy which modulates the interaction between the <x2M receptor and an a2M 
receptor ligand is provided by the invention. This method comprises the steps of: (a) 

3 5 contacting an a2M receptor with one or more test compounds; and (b) measuring the level of 
a2M receptor activity or expression, such that if the level of activity or expression measured 
in (b) differs from the level of a2M receptor activity in the absence of one or more test 
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compounds, then a compound that modulates the interaction between the a2M receptor and 
an ct2M receptor ligand is identified. 

In another embodiment, an assay for identifying a compound that modulates an HSP- 
a2M receptor-mediated process is disclosed. This assay comprises: (a) contacting a test 

5 compound with an HSP and an a2M receptor; and (b) measuring the level of a2M receptor 
activity or expression, such that if the level of activity or expression measured in (b) differs 
from the level of a2M receptor activity in the absence of the test compound, then a 
compound that modulates an HSP-a2M receptor-mediated process is identified. In another 
embodiment, in which the compound identified is an antagonist which interferes with the 

10 interaction of the HSP with the a2M receptor, the method further comprises the step of 
determining whether the level interferes with the interaction of the HSP kid the a2M 
receptor. 

In another embodiment, a cell-based method for identifying a compound that 
modulates an HSP-a2M receptor-mediated process is described, this method comprises the 

1 5 following steps: (a) contacting a test compound with a heat shock protein and an a2M 
receptor-expressing cell; and (b) measuring the level of a2M receptor activity or expression 
in the cell, such that if the level of activity or expression measured in (b) differs from the 
level of a2M receptor activity in the absence of the test compound, then a compound that 
modulates an HSP-a2M receptor-mediated process is identified. 

20 hi another embodiment* a receptor-ligand binding assay for identifying a compound 

that interacts with a2MR, or modulates the binding of an HSP to a2MR. One such method 
comprises: (a) contacting an HSP with an a2M receptor, or fragment, or analog, derivative 
or mimetic thereof, in the presence of a test compound; and (b) measuring the amount of heat 
shock protein bound to the <x2M receptor, or fragment, analog, derivative or mimetic thereof, 

25 such that if the amount of bound heat shock protein measured in (b) differs from the amount 
of bound heat shock protein measured in the absence of the test compound, then a compound 
that modulates the binding of an HSP to the a2M receptor is identified. 

In another embodiment, a method for identifying a compound that modulates antigen 
presentation by <x2MR-expressing cells is provided by the invention. In one embodiment, 

30 such a method comprises: (a) adding one or more test compounds to a mixture of a2MR- 
expressing cells and a complex comprising an a2MR ligand and an antigenic molecule, 
under conditions conducive to a2MR-mediated endocytosis; (2) measuring the level of 
stimulation of antigen-specific cytotoxic T cells by the a2MR-expressing cells, such that if 
the level measured in (b) differs from the level of said stimulation in the absence of the one 

35 or more test compounds, then a compound that modulates antigen presentation by a2MR- 
expressing cells is identified. In another embodiment, a test compound is added to a mixture 
of a2MR-expressing cells and a complex consisting essentially of an HSP noncovalently 
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associated with an antigenic molecule, under conditions conducive to a2MR-mediated 
endocytosis; and the level of stimulation of antigen-specific cytotoxic T cells by the a2MR- 
expressing cells is measured, such that if the level measured differs from the level of said 
stimulation in the absence of the test compound, then a compound that modulates HSP- 

5 mediated antigen presentation by a2MR-expressing cells is identified. 

The assays of the present invention may be first optimized on a small scale (i.e., in 
test tubes), and then scaled up for high-throughput assays. In various embodiments, the in 
vitro screening assays of the present invention may be performed using purified components 
or cell lysates. In other embodiments, the screening assays may be earned out in intact cells' 

10 in culture and in animal models. In accordance with the present invention, test compounds 
which are shown to modulate the activity of the o2M receptor as described herein in vitro, 
will further be assayed in vivo, including cultured cells and animal models to determine if the 
test compound has the similar effects in vivo and to determine the effects of the test 
compound on antigen presentation, cytokine release, intracellular Ca** release, T-cell 

1 5 cytotoxicity, tumor progression, the accumulation or degradation of positive and negative 
regulators, cellular proliferation, etc. 

5.2. 1 a2M RECEPTOR-LIGAND BINDING ASSAYS 

The screening assays, described herein, can be used to identify compounds and 
20 compositions, including peptides and organic, non-protein molecules that interact with the 
a2M receptor, or that modulate the interaction between HSPs and the a2M receptor. 
Recombinant, synthetic, and otherwise exogenous compounds may have binding capacity 
and, therefore, may be candidates for pharmaceutical agents. Alternatively, the proteins and 
compounds include endogenous cellular components which interact with the identified genes 
25 and proteins in vivo. Such endogenous components may provide new targets for 
pharmaceutical and therapeutic interventions. 

Thus, in a preferred embodiment, both naturally occurring and/or synthetic 
compounds (e.g., libraries of small molecules or peptides), may be screened for interacting 
with <x2M receptor and/or modulating a2M receptor activity. In another series of 
30 embodiments, cell lysates or tissue homogenates may be screened for proteins or other 

compounds which bind to one of the normal or mutant a2M receptor genes and <x2M receptor 
polypeptides. 

The screening assays described herein may be used to identify small molecules, 
peptides or proteins, or derivatives, analogs and fragments thereof, that interact with and/or 
35 modulate the interaction of HSPs with the a2M receptor. Such compounds may be used as 
agonists or antagonists of the uptake of a2M receptor ligands, such as HSPs and HSP 
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complexes, by the cell surface receptor. For example, compounds that modulate the a2M 
receptor-ligand interaction include, but are not limited to, compounds that bind to the <x2M 
receptor, thereby either inhibiting (antagonists) or enhancing (agonists) the binding of 
ligands, such as HSPs and HSP complexes, to the receptor, as well as compounds that bind to 

5 the ligand, such as for example, HSPs, thereby preventing or enhancing binding of ligand to 
the receptor. Compounds that affect <x2M receptor gene activity (by affecting <x2M receptor 
gene expression, including molecules, e.g., proteins or small organic molecules, that affect 
transcription or interfere with splicing events so that expression of the full length or truncated 
forms of a2M receptor can be modulated) can also be identified in the screens of the 

10 invention. Further, it should be noted that the assays described can also identify compounds 
that modulate a2M receptor ligand, for example HSP, uptake by a2M receptor (e.g., 
compounds which affect downstream signaling in the a2M receptor signal transduction 
pathway). The identification and use of such compounds which affect signaling events 
downstream of the a2M receptor and thus modulate effects of the receptor on the immune 

15 response are within the scope of the invention. 

Compounds that affect the <x2M receptor gene activity (by affecting the a2M receptor 
gene expression, including molecules, e.g., proteins or small organic molecules, that affect 
transcription or interfere with splicing events so that expression of the full length or the 
truncated form of the a2M receptor can be modulated) can also be identified in the screens of 

20 the invention. However, it should be noted that the assays described can also identify 
compounds that modulate the ct2M receptor signal transduction (e.g., compounds which 
affect downstream signaling events, such as inhibitors or enhancers of endocytic activity 
which is activated by ligand binding to the a2M receptor). The identification and use of such 
compounds which affect signaling events downstream of the a2M receptor and thus 

25 modulate effects of the <x2M receptor on the allergenic response are within the scope of the 
invention. 

The screening assays described herein are designed to detect compounds that 
modulate, i.e. interfere with or enhance, ligand-receptor interactions, including HSP-<x2M 
receptor interactions. As described in detail below, such assays are functional assays, such as 

30 binding assays, that can be adapted to a high-throughput screening methodologies. 

Binding assays can be used to identify compounds that modulate the interaction 
between ligands, for example, HSPs, and the <x2M receptor. In one aspect of the invention 
the screens may be designed to identify compounds that disrupt the interaction between the 
a2M receptor and a ligand, such as, for example, HSPs or peptides derived from an HSP, 

35 a2M, or another a2M receptor ligand. Such compounds will be useful as lead compounds 
for antagonists of HSP~ct2M receptor-related disorders and conditions, such as immune 
disorders, proliferative disorders, and infectious diseases. 
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Binding assays may be performed either as direct binding assays or as competition 
binding assays. In a direct binding assay, a test compound is tested for binding either to the 
a2M receptor or to an a2M receptor ligand, such as an HSP. Then, in a second step, the test 
compound is tested for its ability to modulate the ligand-a2M receptor interaction. 

5 Competition binding assays, on the other hand, assess the ability of a test compound to 
compete with a ligand, ie. an HSP, for binding to the a2M receptor. 

In a direct binding assay, either the ligand and/or the a2M receptor is contacted with a 
test compound under conditions that allow binding of the test compound to the ligand or the 
receptor. The binding may take place in solution or on a solid surface. Preferably, the test 

10 compound is previously labeled for detection. Any detectable compound may be used for 
labeling, such as but not limited to, a luminescent, fluorescent, or radioactive isotope or 
group containing same, or a nonisotopic label, such as an enzyme or dye. After a period of 
incubation sufficient for binding to take place, the reaction is exposed to conditions and 
manipulations that remove excess or non-specifically bound test compound. Typically, it 

1 5 involves washing with an appropriate buffer. Finally, the presence of a ligand-test 
compound (e.g. 9 HSP-test compound) or a the a2M receptor-test compound complex is 
detected. 

In a competition binding assay, test compounds are assayed for their ability to disrupt 
or enhance the binding of the ligand (e.g., HSP) to the a2M receptor. Labeled ligand (e.g., 
20 HSP) may be mixed with the a2M receptor or fragment or derivative thereof, and placed 
under conditions in which the interaction between them would normally occur, with and 
without the addition of the test compound. The amount of labeled ligand (e.g., HSP) that 
binds the <x2M receptor may be compared to the amount bound in the presence or absence of 
test compound. 

25 In a preferred embodiment, to facilitate complex formation and detection, the binding 

assay is carried out with one or more components immoblilized on a solid surface. In 
various embodiments, the solid support could be, but is not restricted to, polycarbonate, 
polystyrene, polypropylene, polyethlene, glass, nitrocellulose, dextran, nylon, 
polyacrylamide and agarose. The support configuration can include beads, membranes, 

30 microparticles, the interior surface of a reaction vessel such as a microtiter plate, test tube or 
other reaction vessel. The immobilization of the <x2M receptor, or other component, can be 
achieved through covalent or non-covalent attachments. In one embodiment, the attachment 
may be indirect, Le. through an attached antibody. In another embodiment, the a2M receptor 
and negative controls are tagged with an epitope, such as glutathione S-transferase (GST) so 

35 that the attachment to the solid surface can be mediated by a commercially available 
antibody such as anti-GST (Santa Cruz Biotechnology). 
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For example, such an affinity binding assay may be performed using a the a2M 
receptor which is immobilized to a solid support. Typically, the non-mobilized component 
of the binding reaction, in this case either ligand (e.g., HSP) or the test compound, is labeled 
to enable detection. A variety of labeling methods are available and may be used, such as 

5 luminescent, chromophore, fluorescent, or radioactive isotope or group containing same, and 
nonisotopic labels, such as enzymes or dyes. In a preferred embodiment, the test compound 
is labeled with a fluorophore such as fluorescein isothiocyanate (FITC, available from Sigma 
Chemicals, St. Louis). 

The labeled test compounds, or ligand (e.g., HSP) plus test compounds, are then 

20 allowed to contact with the solid support, under conditions that allow specific binding to 
occur. After the binding reaction has taken place, unbound and non-specifically bound test 
compounds are separated by means of washing the surface. Attachment of the binding 
partner to the solid phase can be accomplished in various ways known to those skilled in the 
art, including but not limited to chemical cross-linking, non-specific adhesion to a plastic 

15 surface, interaction with an antibody attached to the solid phase, interaction between a ligand 
attached to the binding partner (such as biotin) and a ligand-binding protein (such as avidin 
or streptavidin) attached to the solid phase, and so on. 

Finally, the label remaining on the solid surface may be detected by any detection 
method known in the art. For example, if the test compound is labeled with a fluorophore, a 

20 fluorimeter may be used to detect complexes. 

Preferably, the a2M receptor is added to binding assays in the form of intact cells that 
express the a2M receptor, or isolated membranes containing the a2M receptor. Thus, direct 
binding to the a2M receptor or the ability of a test compound to modulate a ligand-a2M 
receptor complex (e.g., HSP- a2M receptor complex) may be assayed in intact cells in 

25 culture or in animal models in the presence and absence of the test compound. A labeled 
ligand (e.g., HSP) may be mixed with cells that express the <x2M receptor, or to crude 
extracts obtained from such cells, and the test compound may be added. Isolated membranes 
may be used to identify compounds that interact with the a2M receptor. For example, in a 
typical experiment using isolated membranes, cells may be genetically engineered to express 

30 the <x2M receptor. Membranes can be harvested by standard techniques and used in an in 
vitro binding assay. Labeled ligand (e.g., 125 I-labeled HSP) is bound to the membranes and 
assayed for specific activity; specific binding is determined by comparison with binding 
assays performed in the presence of excess unlabeled (cold) ligand. Alternatively, soluble 
a2M receptor may be recombinantly expressed and utilized in non-cell based assays to 

35 identify compounds that bind to the a2M receptor. The recombinantly expressed <x2M 
receptor polypeptides or fusion proteins containing the extracellular domain (ECD) of the 
<x2M receptor, or one or more subdomains thereof, can be used in the non-cell based 
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screening assays. Alternatively, peptides corresponding to one or more of the CDs of the 
a2M receptor, or fusion proteins containing one or more of the CDs of the a2M receptor can 
be used in non-cell based assay systems to identify compounds that bind to the cytoplasmic 
portion of the a2M receptor; such compounds may be useful to modulate the signal 

5 transduction pathway of the a2M receptor. In non-cell based assays the recombinantly 

expressed the a2M receptor is attached to a solid substrate such as a test tube, microtiter well 
or a column, by means well known to those in the art (see Ausubel et aL, supra). The test 
compounds are then assayed for their ability to bind to the ct2M receptor. 

Alternatively, the binding reaction may be carried out in solution. In this assay, the 

1Q labeled component is allowed to interact with its binding partner(s) in solution. If the size 
differences between the labeled component and its binding partner(s) permit such a 
separation, the separation can be achieved by passing the products of the binding reaction 
through an ultrafilter whose pores allow passage of unbound labeled component but not of its 
binding partner(s) or of labeled component bound to its partner(s). Separation can also be 

25 achieved using any reagent capable of capturing a binding partner of the labeled component 
from solution, such as an antibody against the binding partner, a ligand-binding protein 
which can interact with a ligand previously attached to the binding partner, and so on. 

In a one embodiment, for example, a phage library can be screened by passing phage 
from a continuous phage display library through a column containing purified <x2M receptor, 

20 or derivative, analog, fragment, or domain, thereof, linked to a solid phase, such as plastic 
beads. By altering the stringency of the washing buffer, it is possible to enrich for phage that 
express peptides with high affinity for the a2M receptor. Phage isolated from the column 
can be cloned and the affinities of the short peptides can be measured directly. Sequences for 
more than one oligonucleotide can be combined to test for even higher affinity binding to the 

25 ct2M receptor. Knowing which amino acid sequences confer the strongest binding to the 
<x2M receptor, computer models can be used to identify the molecular contacts between the 
<x2M receptor and the test compound. This will allow the design of non-protein compounds 
which mimic those contacts. Such a compound may have the same activity of the peptide 
and can be used therapeutically, having the advantage of being efficient and less costly to 

30 produce. 

In another specific embodiment of this aspect of the invention, the solid support is 
membranes containing the a2M receptor attached to a microtiter dish. Test compounds, for 
example, cells that express library members are cultivated under conditions that allow 
expression of the library members in the microtiter dish. Library members that bind to the 
35 protein (or nucleic acid or derivative) are harvested. Such methods, are described by way of 
example in Parmley and Smith, 1988, Gene 73:305-318; Fowlkes et aL, 1992, 
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BioTechniques 13:422-427; PCT Publication No. WO 94/18318; and in references cited 
hereinabove, 

In another embodiment of the present invention, interactions between the a2M 
receptor or ligand (e.g., HSP) and a test compound may be assayed in vitro. Known or 

5 unknown molecules are assayed for specific binding to the a2M receptor nucleic acids, 
proteins, or derivatives under conditions conducive to binding, and then molecules that 
specifically bind to the a2M receptor are identified. The two components can be measured 
in a variety of ways. One approach is to label one of the components with an easily 
detectable label, place it together with a test components) under conditions that allow 

10 binding to occur, perform a separation step which separates bound labeled component from 
unbound labeled component, and then measure the amount of bound component. In one 
embodiment, the a2M receptor can be labeled and added to a test agent, using conditions that 
allow binding to occur. Binding of the test agent can be determined using polyacrylamide 
gel analysis to compare complexes formed in the presence and absence of the test agent. 

15 In yet another embodiment, binding of ligand (e.g., HSP) to the a2M receptor may be 

assayed in intact cells in animal models. A labeled ligand (e.g., HSP) may be administered 
directly to an animal, with and without a test compound. Uptake of the ligand (e.g., HSP) 
may be measured in the presence and the absence of test compound. For these assays, host 
cells to which the test compound is added may be genetically engineered to express the a2M 

20 receptor and/or ligand (e.g., HSP), which may be transient, induced or constitutive, or stable. 
For the purposes of the screening methods of the present invention, a wide variety of host 
cells may be used including, but not limited to, tissue culture cells, mammalian cells, yeast 
cells, and bacteria. Mammalian cells such as macrophages or other cells that express the 
a2M receptor, i.e., cells of the monocytic lineage, liver parenchymal cells, fibroblasts, 

25 keratinocytes, neuronal cells, and placental syncytiotrophoblasts, may be a preferred cell type 
in which to carry out the assays of the present invention. Bacteria and yeast are relatively 
easy to cultivate but process proteins differently than mammalian cells. 

5.2.2 <x2M RECEPTOR ACTIVITY ASSAYS 

30 After identification of a test compound that interacts with, or modulates the 

interaction of a ligand (e.g., HSP) with a2MR, the test compound can be further 
characterized to measure its effect on a2MR activity and the Ugand-a2MR endocytic 
signaling pathway. For example, the test compound may be characterized by testing its 
effect on ligand (e.g., HSP) /a2MR cellular activity in vivo. Such assays include downstream 

35 signaling assays, antigen presentation assays, assays for antigen-specific activation of 
cytotoxic T cells, and the like. 
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In various embodiments, a candidate compound identified in a primary assay may be 
tested for its effect on innate a2MR signaling activity. For example, downstream signaling 
effects of <x2M jre&eptor activation which can be ass.ay.ed ind^de, but are not limited. to: 
enhanced locoiiloticbknd <&emotWfs of macrophages (Foitester ^'tf/.^I^j.Irrtmunology 
5 50: 251-259), down regulation of proteinase synthesis, and elevation of intracellular calcium, 
inositol phosphates and cyclic AMP (Misra et al 9 1993, Biochem. J., 290:885-891). Other 
innate immune responses that can be tested are release of cytokines (i.e., DL-12, ILlp, 
GMCSF, and TNFa). Thus, as secondary assays, any identified candidate compound can be 
tested for changes in such activities in the presence and absence. 

19 For example, in one embodiment, a chemotaxis assay can be used to further 
characterize a candidate identified by a primary screening assay. It is known that a2M 
modified by protease interaction can induce directional migration of cells towards their 
ligand. A number of techniques can be used to test chemotactic migration in vitro (see, e.g., 
Leonard et aL, 1995, 'Measurement of a and B Chemokines", in Current Protocols in 

15 Immunology, 6.12.1-6.12.28, Ed. Coligane* aL, John Wiley & Sons, Inc. 1995). For 

example, in one embodiment, a candidate compound can be tested for its ability to modulate 
the ability of a2MR to induce migration of cells that express the receptor using a chemokine 
gradient in a multiwell Boyden chemotaxis chamber. In a specific example of this method, a 
serial dilution of a ligand (e.g., an HSP) / a2MR antagonist or agonist test compound 

20 identified in the primary screen is placed in the bottom wells of the Boyden chemotaxis 
chamber. A constant amount of ligand is also added to the dilution series. As a control, at 
least one aliquot contains only ligand (e.g. , HSP). The contribution of the antagonist or 
agonist compound to the chemotactic activity of <x2MR is measured by comparing number of 
migrating cells on the lower surface of the membrane filter of the aliquots containing only 

25 ligand (e.g. 9 HSP), with the number of cells in aliquots containing test compound and ligand 
(e.g., HSP). If addition of the test compound to the ligand (e.g., HSP) solution results in a 
decrease in the number of cells detected the membrane relative to the number of cells 
detected using a solution containing only ligand (e.g., HSP), then an antagonist of ligand 
(e.g., HSP) induction of chemotactic activity of a2MR-expressing cells is identified. 

30 Elevation in intracellular ionized calcium concentration ([Ca 2 "^) is also an indicator 

of a2MR activation (Misra et aL, 1993, supra). Thus, in another embodiment, calcium flux 
assays can be used as secondary screens to further characterize modulators of ligand-a2MR 
interactions. Intracellular calcium ion concentration can be measured in cells that express the 
a2M receptor in the presence of the ligand, in the presence and the absence of a test 

35 compound. For example, calcium mobilization can be detected and measured by flow 
cytometry, by labeling with fluorescent dyes that are trapped intracellularly A fluorescent 
dye such as Indo-1 exhibits a change in emission spectrum upon binding calcium, the ratio of 
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fluorescence produced by the calcium-bound dye to that produce by the unbound dye may be 
used to estimate the intracellular calcium concentration* In a specific embodiment, cells are 
incubated in a cuvette in media containing Indo-1 at 37°C and are excited, and fluorescence 
is measured using a fluorimeter (Photon Technology Corporation, International). The ligand 

5 is added at a specific time point, in the presence and the absence of a test compound, EGTA 
is added to the cuvette to release and chelate total calcium, and the response is measured. 
Binding of ligand results in increased intracellular Ca 2+ concentration in cells that express 
a2MR. An agonist results in a relative increased intracellular Ca 2+ concentration, whereas an 
antagonist results in a relative decreased intracellular Ca 2+ concentration 

lO In other embodiments, antigen-specific response assays may be used to detect the 

effect of a candidate compound on presentation of antigenic molecule by an a2MR ligand, 
for example an HSP or HSP complex. For example, an antigen presentation assay may be 
performed to determine the effect of a compound in vivo on the uptake of complexes capable 
of interacting with the <x2M receptor, e.g., HSP-antigenic molecule complexes, by cells 

15 expressing the a2M receptor. Such re-presentation assays are known in the art, and have 
been described previously (Suto and Srivastava, 1995, Science 269:1585-1588). For 
example, in one embodiment, antigen presenting cells, such as a macrophage cell line (e.g., 
RAW264.7), are mixed with antigen-specific T cells in media, using approximately 10,000 
cells of each type at approximately a 1:1 ratio. Complexes of HSP (10 ng/ml) and a peptide 

20 antigen, as well as test compound, is added to the cells and the culture is incubated for 
approximately 20 hours. Stimulation of T cells may then be measured in the presence and 
absence of test compound. 

In another embodiment, antigen-specific T cell stimulation may be assayed. In one 
embodiment an IFN-y release assay may be used. After washing, cells are fixed, 

25 penneabilized, and reacted with dye-labeled antibodies reactive with human IFN-y (PE- anti- 
IFN-y). Samples are analyzed by flow cytometry using standard techniques. Alternatively, a 
filter immunoassay, ELISA (enzyme linked immunosorbent assay), or enzyme-linked 
immunospot assay (ELISPOT) assay, may be used to detect specific cytokines produced by 
an activated T cell. In one embodiment, for example, a nitrocellulose-backed microtiter plate 

30 is coated with a purified cytokine-specific primary antibody, i.e. t anti-IFN-y, and the plate is 
blocked to avoid background due to nonspecific binding of other proteins. A sample of APC 
cells stimulated with antigen is diluted onto the wells of the microtiter plate. A labeled, e.g. , 
biotin-labeled, secondary anti-cytokine antibody is added. The antibody cytokine complex 
can then be detected, i.e., by enzyme-conjugated streptavidin - cytokine-secreting cells will 

35 appear as "spots" by visual, microscopic, or electronic detection methods. In another 
embodiment, "tetramer staining" assay (Altman et al, 1996, Science 274: 94-96) may be 
used to identify antigen-specific T-cells. For example, an MHC molecule containing a 
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specific peptide antigen, such as a tumor-specific antigen, is multimerized to make soluble 
peptide tetramers and labeled, for example, by complexing to streptavidin. The MHC- 
peptide antigen complex is then mixed with a population of stimulated T cells. Biotin is then 
used to stain T cells which recognize and bind to the MHC-antigen complex. 

5.2.3 COMPOUNDS THAT CAN BE SCREENED IN ACCORDANCE WITH 
THE INVENTION 

The screening assays described herein may be used to identify small molecules, 
peptides or proteins, or derivatives, analogs and fragments thereof, that interact with, or 
modulate the interaction of a ligand (e.g., HSP) with the a2M receptor. The compounds 
which may be screened in accordance with the invention include, but are not limited to small 
molecules, peptides, antibodies and fragments thereof, and other organic compounds (e.g., 
peptidomimetics) that bind to the ECD of the a2M receptor and either inhibit the activity 
triggered by the natural ligand (i.e., antagonists) or mimic the activity triggered by the 
natural ligand (i.e., agonists), as well as small molecules, peptides, antibodies or fragments 
thereof, and other organic compounds. In one embodiment, such compounds include 
sequences of the a2M receptor, such as the ECD of the a2M receptor (or a portion thereof), 
which can bind to and "neutralize" natural ligands, such as HSPs, a2M, LDL, etc. In 
another embodiment, such compounds include ligand sequences, such as HSP sequences 
and/or a2M sequences, which can bind to the active site of the a2M receptor, and block its 
activity. 

Compounds that may be used for screening include, but are not limited to, peptides 
such as, for example, soluble peptides, including but not limited to members of random 
peptide libraries; (see, e.g., Lam et al, 1991, Nature 354:82-84; Houghten et al, 1991, 
Nature 354:84-86), and combinatorial chemistry-derived molecular library made of D- and/or 
L- configuration amino acids, phosphopeptides (including, but not limited to, members of 
random or partially degenerate, directed phosphopeptide libraries; see, e.g., Songyang et al, 
1993, Cell 72:767-778), antibodies (including, but not limited to, polyclonal, monoclonal, 
humanized, anti-idiotypic, chimeric or single chain antibodies, and FAb, F(ab') 2 and FAb 
expression library fragments, and epitope-binding fragments thereof), and small organic or 
inorganic molecules. 

In one embodiment of the present invention, peptide libraries may be Used as a source 
of test compounds that can be used to screen for modulators of <x2MR interactions, such as 
HSP- <x2M receptor. Diversity libraries, such as random or combinatorial peptide or 
nonpeptide libraries can be screened for molecules that specifically bind to the a2M receptor. 
Many libraries are known in the art that can be used, e.g., chemically synthesized libraries, 
recombinant (e.g., phage display libraries), and in vitro translation-based libraries. 
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Examples of chemically synthesized libraries ate described in Fodor et al, 1991, 
Science 251:767-773; Houghten et al, 1991, Nature 354:84-86; Lam et al, 1991, 
Nature 354:82-84; Medynski, 1994, Bio/Technology 12:709-710; Gallop etal, 1994, J. 
Medicinal Chemistry 37(9): 1233-1251; Ohlmeyer et al, 1993, Proc. Natl. Acad. Sci. USA 
5 90:10922-10926; Erb etal, 1994, Proc. Natl. Acad. Sci. USA 91:1 1422-1 1426; Houghten et 
al, 1992, Biotechniques 13:412; Jayawickreme et al, 1994, Proc. Natl. Acad. Sci. USA 
91:1614-1618; Salmon etal, 1993, Proc. Natl. Acad. Sci. USA 90:11708-11712; PCT 
Publication No. WO 93/20242; and Brenner and Lerner, 1992, Proc. Natl. Acad. Sci. USA 
89:5381-5383. 

10 Examples of phage display libraries are described in Scott & Smith, 1990, 

Science 249:386-390; Devlin et al, 1990, Science, 249:404-406; Christian et al, 1992, J. 
Mol. Biol. 227:71 1-718; Lenstra, 1992, J. Immunol. Meth. 152:149-157; Kay et al, 1993, 
Gene 128:59-65; and PCT Publication No. WO 94/18318 dated August 18, 199,4. 

By way of examples of nonpeptide libraries, a benzodiazepine library {see e.g., Bunin 

15 etal, 1994, Proc. Natl. Acad. Sci. USA 91:4708-4712) can be adapted for use. Peptoid 
libraries (Simon et al, 1992, Proc. Natl. Acad. Sci. USA 89:9367-9371) can also be used. 
Another example of a library that can be used, in which the amide functionalities in peptides 
have been permethylated to generate a chemically transformed combinatorial library, is 
described by Ostresh et al (1994, Proc. Natl. Acad. Sci. USA 91:1 1 138-1 1142). 

20 Screening the libraries can be accomplished by any of a variety of commonly known 

methods. See, e.g., the following references, which disclose screening of peptide libraries: 
Parmley & Smith, 1989, Adv. Exp. Med. Biol 251:215-218; Scott & Smith, 1990, 
Science 249:386-390; Fowlkes et al, 1992; BioTechniques 13:422-427; Oldenburg et al, 
1992, Proc. Natl. Acad. Sci. USA 89:5393-5397; Yu et al, 1994, Cell 76:933-945; Staudt et 

25 al, 1988, Science 241:577-580; Bock et al, 1992, Nature 355:564-566; Tuerk et al, 1992, 
Proc. Natl. Acad. Sci. USA 89:6988-6992; Ellington et al, 1992, Nature 355:850-852; U.S. 
Patent No. 5,096,815, US. Patent No. 5,223,409, and U.S. Patent No. 5,198,346, all to 
Ladner et al; Rebar & Pabo, 1993, Science 263:671-673; and PCT Publication No. 
WO 94/18318. 

30 In another embodiment of the present invention, the screening may be performed by 

adding the labeled ligand (e.g., HSP) to in vitro translation systems such as a rabbit 
reticulocyte lysate (RRL) system and then proceeding with in vitro priming reaction. In vitro 
translation-based libraries include but are not limited to those described in PCT Publication 
No. WO 91/05058 dated April 18, 1991; and Mattheakis et al, 1994, Proc. Natl. Acad. Sci. 

35 USA 91:9022-9026. 

Compounds that can be tested and identified methods described herein can include, 
but are not limited to, compounds obtained from any commercial source, including Aldrich 
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(Milwaukee, WI 53233), Sigma Chemical (St. Louis, MO), Fluka Chemie AG (Buchs, 
Switzerland) Fluka Chemical Corp. (Ronkonkoma, NY;), Eastman Chemical Company, Fine 
Chemicals (Kingsport, TN), Boehringer Mannheim GmbH (Mannheim, Germany), Takasago 
(Rockleigh, NJ), SST Corporation (Clifton, NJ), Ferro (Zachary, LA 70791), Riedel-deHaen 

5 Aktiengesellschaft (Seelze, Germany), PPG Industries Inc., Fine Chemicals (Pittsburgh, PA 
1 5272). Further any kind of natural products may be screened using the methods of the 
invention, including microbial, fungal, plant or animal extracts. 

Furthermore, diversity libraries of test compounds, including small molecule test 
compounds, may be utilized. For example, libraries may be commercially obtained from 

10 Specs and BioSpecs B.V. (Rijswijk, The Netherlands), Chembridge Corporation (San Diego, 
CA), Contract Service Company (Dolgoprudny, Moscow Region, Russia), Comgenex USA 
Inc. (Princeton, NJ), Maybridge Chemicals Ltd. (Cornwall PL34 OHW, United Kingdom), 
and Asinex (Moscow, Russia). 

Still further, combinatorial library methods known in the art, can be utilize, including, 

15 but not limited to: biological libraries; spatially addressable parallel solid phase or solution 
phase libraries; synthetic library methods requiring deconvolution; the "one-bead 
one-compound" library method; and synthetic library methods using affinity chromatography 
selection. The biological library approach is limited to peptide libraries, while the other four 
approaches are applicable to peptide, non-peptide oligomer or small molecule libraries of 

20 compounds (Lam,1997, Anticancer Drug Des.l2:145). Combinatorial libraries of test 
compounds, including small molecule test compounds, can be utilized, and may, for 

example, be generated as disclosed in Eichler & Houghten, 1995, Mol. Med. Today 1:174- | 

180; Dolle, 1997, MoL Divers. 2:223-236; and Lam, 1997, Anticancer Drug Des. 12:145- 

167. 

25 Examples of methods for the synthesis of molecular libraries can be found in the art, 

for example in: DeWitt et al, 1993, Proc. Natl. Acad. Sci. USA 90:6909; Erb et al 9 1994, 
Proc. Natl. Acad. Sci. USA 91:11422; Zuckermann et al 9 1994, J. Med. Chem. 37:2678; Cho 
etal, 1993, Science 261:1303; Carrell etal, 1994, Angew. Chem. Int. Ed. Engl. 33:2059; 
Carell etal, 1994, Angew. Chem. Int. Ed. Engl. 33:2061; and Gallop etal, 1994, J. Med. 

30 Chem. 37:1233. 

Libraries of compounds may be presented in solution (eg., Houghten, 1992, 
BioTechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 
1993, Nature 364:555-556), bacteria (U.S. Patent No. 5,223,409), spores (Patent Nos. 
5,571,698; 5,403,484; and 5,223,409), plasmids (Cull et al 9 1992, Proc. Natl. Acad. Sci. 

35 USA 89:1865-1869) or phage (Scott and Smith, 1990, Science 249:386-390; Devlin, 1990, 
Science 249:404-406; Cwirla et ai, 1990, Proc. Natl. Acad. Sci. USA 87:6378-6382; and 
Felici, 1991, J. Mol. Biol. 222:301-310). 
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Screening the libraries can be accomplished by any of a variety of commonly known 
methods. See, e.g., the following references, which disclose screening of peptide libraries: 
Parmley & Smith, 1989, Adv. Exp. Med. Biol. 251 :215-218; Scott & Smith, 1990, 
Science 249:386-390; Fowlkes et al, 1992; BioTechniques 13:422-427; Oldenburg et al, 

5 1992, Proc. Natl. Acad. Sci. USA 89:5393-5397; Yu et al, 1994, Cell 76:933-945; Staudt et 
al, 1988, Science 241:577-580; Bock et al, 1992, Nature 355:564-566; Tuerk et al, 1992, 
Proc. Natl. Acad. Sci. USA 89:6988-6992; Ellington et al, 1992, Nature 355:850-852; U.S. 
■Patent No. 5,096,815, U.S. Patent No. 5,223,409, and U.S. Patent No. 5,198,346^ all to 
Ladnere*a/.; Rebar&Pabo, 1993, Science 263:671-673; and PCT Publication No. 

10 WO 94/18318. 

5.3 IDENTIFICATION OF FRAGMENTS OF THE a2M RECEPTOR AND/OR a2M 
RECEPTOR LIGANDS, SUCH AS HSPS, USEFUL FOR IMMUNOTHERAPY 

The invention also encompasses methods for identifying ligand-binding <x2MR 
fragments (such as "HSP-binding domains"), and analogs, muteins, or derivatives thereof, 
which are capable of binding to, and uptake of, <x2MR ligand-antigenic peptide, such as HSP- 
antigenic peptide complexes. Such ligand-binding <x2MR fragment, e.g. , HSP-binding 
domains, can then be tested for activity in vivo and in vitro using the a2M receptor/ligand 
^ binding assays, described in Section 5.2.1, above. In one embodiment, such a method for 
identifying an a2MR fragment capable of binding a heat shock protein comprises the steps 
of: (a) contacting a heat shock protein with one or more a2MR fragments; and (b) 
identifying an a2MR polypeptide fragment which specifically binds to the heat shock 
protein. 

25 Ligand-binding domains, e.g., HSP-binding domains, of the a2MR capable of 

binding ligand-antigenic peptide complexes, such as HSP-antigenic peptide complexes, and 
can be further tested for activity using either in vivo binding assays, re-presentation assays, or 
CTL assays, such as those described in Section 5.2.2, above. For example, one such method 
for identifying an a2MR fragment capable of inducing an HSP-a2M receptor-mediated 

^ process comprises the steps of: (a) contacting a heat shock protein with cell expressing 
a2MR fragment; and (b) measuring the level of o2MR activity in the cell, such that if the 
level of the HSP-a2M receptor-mediated process or activity measured in (b) is greater than 
the level of <x2MR activity in the absence of the <x2MR fragment, then an a2MR fragment 
capable of inducing an HSP-a2M receptor-mediated process is identified. Depending on 

^ their behavior in such assays, such molecules can be used to either enhance or, alternatively, 
block the function of the receptor when administered or expressed in vivo. For example, 
these assays can be used to identify a2MR HSP-binding domains which can bind HSP- 
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antigen complexes and negatively interfere with their uptake by antigen presenting cells. 
These antagonists could be used to downregulate immune responses which are caused by 
cellular release of HSPs. Alternatively, certain a2MR HSP-binding domains may be used to 
enhance HSP-antigen complex uptake arid signaling. Such agonists could be administered or 

5 expressed in subjects to elicit an immune response against an antigen of interest. 

In another embodiment, the invention encompasses methods for identifying ligand 
fragment, such as HSP fragments, which are capable of binding and being taken up by the 
<x2M receptor ("a2M receptor-binding domains"), and analogs, muteins, or derivatives 
thereof. As described for assays for a2M receptor-related polypeptides described above* 

10 a2 M receptor-binding domains can then be tested for activity in vivo and in vitro using 
the binding assays described in Section 5.2.1, above. For example, one such method for 
identifying a heat shock protein fragment capable of binding an a2M receptor comprises: (a) 
contacting an a2M receptor with one or more heat shock protein fragments; and (b) 
identifying a heat shock protein fragment which specifically binds to the <x2M receptor. 

1 5 Ligand fragments, such as HSP fragments* of interest may be further tested in cells, 

using in vivo binding assays, re-presentatiori assays, or CTL assays, such as those described 
in Section 5.2.2, above. For example, in one embodiment, such a method for identifying a 
heat shock protein fragment capable of inducing an HSP-a2M receptor-mediated process 
comprises: a) contacting an a2M receptor fragment with a cell expressing a heat shock 

20 protein; and b) measuring the level of a2MR activity in the cell, such that if the level of the 
HSP-a2M receptor-mediated process or activity measured in (b) is greater than the level of 
a2MR activity in the absence of said heat shock protein fragment. Alternatively, a2M 
receptor-binding domains which decrease uptake of HSPs could be used to block HSP uptake 
by the a2M receptor. In one embodiment, such HSP fragments comprising a2M receptor- 

25 binding domain sequences could be used to construct recombinant fusion proteins, comprised 
of a heat shock protein a2M receptor-binding domain and an antigenic peptide sequence. 
Such recombinant fusion proteins may be used to elicit an immune response and to treat or 
prevent immune diseases and disorders (Suzue et al, 1997, Proc. Natl. Acad. Sci. U.S.A. 94: 
13146-51). 

30 The a2M receptor fragments, analogs, muteins, and derivatives and/or ligand (e.g., 

HSP) fragments, analogs, muteins, and derivatives of the invention may be produced by 
recombinant DNA techniques, synthetic methods, or by enzymatic or chemical cleavage of 
native <x2M receptor and/or ligands (e.g., HSPs). 

Any eukaryotic cell may serve as the nucleic acid source for obtaining the coding 

35 region of an a2M receptor or a2M receptor ligand (e.g. , HSP) gene. Nucleic acid sequences 
encoding ligand, e.g., HSPs, and or the a2M receptor can be isolated from vertebrate, 
mammalian, as well as primate sources, including humans. Amino acid sequences and 
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nucleotide sequences of naturally occurring hgands, e.g., HSPs, and a2M receptor are 
generally available in sequence databases, such as Genbank. 

The DNA may be obtained by standard procedures known in the art by DNA 
amplification or molecular cloning directly from a tissue, cell culture, or cloned DNA (e.g., a 

5 DNA "library"). Clones derived from genomic DNA may contain regulatory and intron 
DNA regions in addition to coding regions; clones derived from cDNA will contain only 
exon sequences. In a preferred embodiment, DNA can be amplified from genomic or cDNA 
by polymerase chain reaction (PGR) amplification using primers designed from the known 
sequence of an a2M receptor ligand, e.g., HSP, a2M, or other a2MR ligand. The polymerase 

10 chain reaction (PCR) is commonly used for obtaining genes or gene fragments of interest. 
For example, a nucleotide sequence encoding a fragment of any desired length can be 
generated using PCR primers that flank the nucleotide sequence encoding the peptide- 
binding domain. Alternatively, an a2MR ligand, e.g., HSP, a2M, or other a2MR ligand 
receptor gene sequence can be cleaved at appropriate sites with restriction endonuclease(s) if 

15 such sites are available, releasing a fragment of DNA encoding the peptide-binding domain. 
If convenient restriction sites are not available, they may be created in the appropriate 
positions by site-directed mutagenesis and/or DNA amplification methods known in the art 
(see, for example, Shankarappa et al 9 1992, PCR Method Appl. 1:277-278). The DNA 
fragment that encodes a fragment of the ligand (e.g., HSP) or <x2M receptor gene is then 

20 isolated, and ligated into an appropriate expression vector, care being taken to ensure that the 
proper translation reading frame is maintained. Alternatives to isolating the genomic DNA 
include, but are not limited to, chemically synthesizing the gene sequence itself from a 
known sequence or making cDNA to the mRNA which encodes the ligand (e.g., HSP) and/or 
a2M receptor. 

25 Any technique for mutagenesis known in the art can be used to modify individual 

nucleotides in a DNA sequence, for purpose of making amino acid substitution(s) in the 
expressed peptide sequence, or for creating/deleting restriction sites to facilitate further 
manipulations. Such techniques include but are not limited to, chemical mutagenesis, in 
vitro site-directed mutagenesis (Hutchinson, C, et al 9 1978, J. Biol. Chem 253:6551), 

30 ohgonucleotide-directed mutagenesis (Smith, 1985, Ann. Rev. Genet. 19:423-463; Hill et al, 
1987, Methods Enzymol. 155:558-568), PCR-based overlap extension (Ho et al, 1989, Gene 
77:51-59), PCR-based megaprimer mutagenesis (Sarkar et al, 1990, Biotechniques, 8:404- 
407), etc. Modifications can be confirmed by double stranded dideoxy DNA sequencing. 
An alternative to producing a2M receptor and/or ligand (e.g., HSP) fragments by 

35 recombinant techniques is peptide synthesis. For example, a peptide corresponding to a 
portion of an a2M receptor and/or ligand (e.g., HSP) comprising the substrate-binding 
domain, or which binds peptides in vitro, can be synthesized by use of a peptide synthesizer. 
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Conventional peptide synthesis may be used or other synthetic protocols well known in the 
art. 

In addition, analogs and derivatives of ct2M receptor and/or ligand (e.g., HSP) can be 
chemically synthesized. Furthermore, if desired, nonclassical amino acids or chemical 

5 amino acid analogs can be introduced as a substitution or addition into the a2M receptor 
and/or ligand (e.g., HSP) sequence. Non-classical amino acids include but are not limited to 
the D-isomers of the common amino acids, a-amino isobutyric acid, 4-aminobutyric acid, 
Abu, 2-amino butyric acid, y-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric 
acid, 3 -amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, 

10 citrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, P- 
alanine, fluoro-amino acids, designer amino acids such as p-methyl amino acids, Ca-methyl 
amino acids, Na-methyl amino acids, and amino acid analogs in general. 

o2M receptor and/or ligand (e.g., HSP) peptides, or a mutant or derivative thereof, 
may be synthesized by solid-phase peptide synthesis using procedures similar to those 

15 described by Merrifield, 1963, J. Am. Chem. Soc, 85:2149. During synthesis, N-a-protected 
amino acids having protected side chains are added stepwise to a growing polypeptide chain 
linked by its C-terminal and to an insoluble polymeric support i.e., polystyrene beads. The 
peptides are synthesized by linking an amino group of an N-a-deprotected amino acid to an 
a-carboxyl group of an N-a-protected amino acid that has been activated by reacting it with a 

20 reagent such as dicyclohexylcarbodiimide. The attachment of a free amino group to the 
activated carboxyl leads to peptide bond formation. The most commonly used N-a- 
protecting groups include Boc which is acid labile and Fmoc which is base labile. Details of 
appropriate chemistries, resins, protecting groups, protected amino acids and reagents are 
well known in the art and so are not discussed in detail herein (See, Atherton, et ah, 1989, 

25 Solid Phase Peptide Synthesis: A Practical Approach, IRL Press, and Bodanszky, 1993, 
Peptide Chemistry, A Practical Textbook, 2nd Ed., Springer- Verlag). 

Purification of the resulting fragment is accomplished using conventional procedures, 
such as preparative HPLC using gel permeation, partition and/or ion exchange 
chromatography. The choice of appropriate matrices and buffers are well known in the art 

30 and so are not described in detail herein. 

In an alternative embodiment, fragments of an a2M receptor and/or ligand (e.g., 
HSP) may be obtained by chemical or enzymatic cleavage of native or recombinant a2M 
receptor and/or ligand (e.g., HSP) molecules. Specific chemical cleavage can be performed 
by cyanogen bromide, NaBH 4 , acetylation, formylation, oxidation, reduction, metabolic 

35 synthesis in the presence of tunicamycin, etc. . Endoproteases that cleave at specific sites 
can also be used. Such proteases are known in the art, including, but not limited to, trypsin, 
a-chymotrypsin, V8 protease, papain, and proteinase K (see Ausubel et ah, (eds.), in 
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"Current Protocols in Molecular Biology", Greene Publishing Associates and Wiley 
Interscience, New York, 17.4.6-17.4.8). The a2M receptor and/or ligand (eg., HSP) amino 
acid sequence of interest can be examined for the recognition sites of these proteases. An 
enzyme is chosen which can release a peptide-binding domain or peptide-binding fragment. 

5 The a2M receptor and/or ligand (e.g., HSP) molecule is then incubated with the protease, 
under conditions that allow digestion by the protease and release of the specifically 
designated peptide-binding fragments. Alternatively, such protease digestions can be canied 
out blindly, i.e., not knowing which digestion product will contain the peptide-binding 
domain, using specific or general specificity proteases, such as proteinase K or pronase. 

10 Once a fragment is prepared, the digestion products may be purified as described 

above, and subsequently tested for the ability to bind peptide or for immunogenicity. 
Methods for determining the immunogenicity of ct2M receptor ligand (e.g., HSP) complexes 
by cytotoxicity tests are described in Section 5.2.2. 

15 5.4 DRUG DESIGN 

Upon identification of a compound that interacts with a2MR, or modulates the 
interaction of an <x2M receptor ligand, such as an HSP, with the <x2M receptor, such a 
compound can be further investigated to test for an ability to alter the immune response. In 
particular, for example, the compounds identified via the present methods can be further 
20 tested in vivo in accepted animal models of HSP-a2MR-mediated processes and HSP- <x2MR 
related disorders, such as, e.g., immune disorders, proliferative disorders, and infectious 
diseases. 

Computer modeling and searching technologies permit identification of compounds, 
or the improvement of already identified compounds, which can modulate the interaction of 

25 the a2M receptor with its ligand, e.g., an HSP. Having identified such a compound or 
composition, the active sites or regions are identified. Such active sites might typically be 
ligand binding sites. The active site can be identified using methods known in the art 
including, for example, from the amino acid sequences of peptides, from the nucleotide 
sequences of nucleic acids, or from study of complexes of the relevant compound or 

30 composition with its natural ligand. In the latter case, chemical or X-ray crystallographic 
methods can be used to find the active site by finding where on the factor the complexed 
ligand is found. 

Next, the three dimensional geometric structure of the active site is determined. This 
can be done by known methods, including X-ray crystallography, which can determine a 
35 complete molecular structure. On the other hand, solid or liquid phase NMR can be used to 
determine certain intra-molecular distances. Any other experimental method of structure 
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determination can be used to obtain partial or complete geometric structures. The geometric 
structures may be measured with a complexed ligand, natural or artificial, which may 
increase the accuracy of the active site structure determined. 

If an incomplete or insufficiently accurate structure is determined, the methods of 

5 computer based numerical modeling can be used to complete the structure or improve its 
accuracy. Any recognized modeling method may be used, including parameterized models 
specific to particular biopolymers such as proteins or nucleic acids, molecular dynamics 
models based on computing molecular motions, statistical mechanics models based on 
thermal ensembles, or combined models. For most types of models, standard molecular 

lO force fields, representing the forces between constituent atoms and groups, are necessary, and 
can be selected from force fields known in physical chemistry. The incomplete or less 
accurate experimental structures can serve as constraints on the complete and more accurate 
structures computed by these modeling methods. 

Finally, having determined the structure of the active site, either experimentally, by 

15 modeling, or by a combination, candidate modulating compounds can be identified by 
searching databases containing compounds along with information on their molecular 
structure. Such a search seeks compounds having structures that match the determined active 
site structure and that interact with the groups defining the active site. Such a search can be 
manual, but is preferably computer assisted. These compounds found from this search are 

20 potential the a2M receptor-modulating compounds* 

Alternatively, these methods can be used to identify improved modulating 
compounds from an already known modulating compound or ligand. The composition of the 
known compound can be modified and the structural effects of modification can be 
determined using the experimental and computer modeling methods described above applied 

25 to the new composition. The altered structure is then compared to the active site structure of 
the compound to determine if an improved fit or interaction results. In this manner 
systematic variations in composition, such as by varying side groups, can be quickly 
evaluated to obtain modified modulating compounds or ligands of improved specificity or 
activity. 

30 Further experimental and computer modeling methods useful to identify modulating 

compounds based upon identification of the active sites of either the a2M receptor or the 
HSP, and other a2M receptor ligands and their analogs, will be apparent to those of skill in 
the art. 

Examples of molecular modeling systems are the CHARMm and QUANTA 
35 programs (Polygen Corporation, Waltham, MA). CHARMm performs the energy 
minimization and molecular dynamics functions. QUANTA performs the construction, 
graphic modelling and analysis of molecular structure. QUANTA allows interactive 
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construction, modification, visualization, and analysis of the behavior of molecules with each 
other. 

A number of articles review computer modeling of drugs interactive with specific 
proteins, such as Rotivinen et al) 1988, Acta Pharmaceutical Fennica 97:159-166); Ripka 

5 (1988 New Scientist 54-57); McKinaly and Rossmann (1989, Annu. Rev. Pharmacol. 
Toxiciol. 29:111-122); Perry andDavies, OSAR: Quantitative Structure-Activity 
Relationships in Drug Design pp. 189-193 Alan R. Liss, Inc. 1989; Lewis and Dean (1989, 
Proc. R. Soc. Lond. 236:125-140 and 141-162); and, with respect to a model receptor for 
nucleic acid components, Askew et al (1989, J. Am. Chem. Soc. 1 1 1 :1082-1090). Other 

IQ computer programs that screen and graphically depict chemicals are available from 
companies such as BioDesign, Inc. (Pasadena, CA.), Allelix, Inc. (Mississauga, Ontario, 
Canada), and Hypercube, Inc. (Cambridge, Ontario). Although these are primarily designed 
for application to drugs specific to particular proteins, they can be adapted to design of drugs 
specific to regions of DNA or RNA, once that region is identified. 

15 

5.5 DIAGNOSTIC USES 

The a2M receptor is a cell surface protein present on many tissues and cell types 
(Herz et al, 1988, EMBO J. 7:41 19-27; Moestrup et al, 1992, Cell Tissue Res. 269: 375- 
82), that appears to be involved in the specific uptake and re-presentation of a2M receptor 

20 ligands, such as HSPs and HSP- peptide complexes. The ot2M receptor was initially 
identified as a heat shock protein receptor due to its interaction with gp96, which is 
exclusively intracellular and is released as a result of necrotic but not apoptotic cell death. 
Thus, gp96 uptake by the a2M receptor may act as a sensor of necrotic cell death. As such, 
a2M receptor-ligand complexes may be used to detect and diagnose proliferative disorders, 

25 such as cancer, autoimmune disorders and infectious disease. Therefore, <x2M receptor 
proteins, analogues, derivatives, and subsequences thereof, a2M receptor nucleic acids (and 
sequences complementary thereto), and anti-a2M receptor antibodies, have uses in detecting 
and diagnosing such disorders. 

The a2M receptor and ot2M receptor nucleic acids can be used in assays to detect, 

30 prognose, or diagnose immune system disorders that may result in tumorigenesis, 
carcinomas, adenomas etc, and viral disease. 

The molecules of the present invention can be used in assays, such as immunoassays, 
to detect, prognose, diagnose, or monitor various conditions, diseases, and disorders affecting 
<x2M receptor expression, or monitor the treatment thereof. In particular, such an 

35 immunoassay is carried out by a method comprising contacting a sample derived from a 
patient with an HSP-a2M receptor specific antibody under conditions such that 
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immunospecific binding can occur, and detecting or measuring the amount of any 
immunospecific binding by the antibody. In a specific aspect, such binding of antibody, in 
tissue sections, can be used to detect aberrant a2M receptor localization or aberrant (e.g., low 
or absent) levels of a2M receptor. In a specific embodiment, antibody to the <x2M receptor 

5 can be used to assay a patient tissue or serum sample for the presence of the a2M receptor 
where an aberrant level of a2M receptor is an indication of a diseased condition. By 
"aberrant levels," is meant increased or decreased levels relative to that present, or a standard 
level representing that present, in an analogous sample from a portion of the body or from a 
subject not having the disorder. 

10 The immunoassays which can be used include but are not limited to competitive and 

non-competitive assay systems using techniques such as western blots, immunohisto- 
chemistry radioimmunoassays, ELISA, "sandwich" immunoassays, immunoprecipitation 
assays, precipitin reactions, gel diffusion precipitin reactions, immunodiffusion assays, 
agglutination assays, complement-fixation assays, immunoradiometric assays, fluorescent 

15 immunoassays, protein A immunoassays, to name but a few. 

ct2M receptor genes and related nucleic acid sequences and subsequences, including 
complementary sequences, can also be used in hybridization assays. a2M receptor nucleic 
acid sequences, or subsequences thereof, comprising about at least 8 nucleotides, can be used 
as hybridization probes. Hybridization assays can be used to detect, prognose, diagnose, or 

20 monitor conditions, disorders, or disease states associated with aberrant changes in a2M 
receptor expression and/or activity as described supra. In particular, such a hybridization 
assay is carried out by a method comprising contacting a sample containing nucleic acid with 
a nucleic acid probe capable of hybridizing to a2M receptor DNA or RNA, under conditions 
such that hybridization can occur, and detecting or measuring any resulting hybridization. 

25 In specific embodiments, diseases and disorders involving decreased immune 

responsiveness during an infection or malignant disorder can be diagnosed, or their suspected 
presence can be screened for, or a predisposition to develop such disorders can be detected, 
by detecting decreased levels of a2M receptor protein, a2M receptor RNA, or the a2M 
receptor functional activity (e.g., binding to HSP, antibody-binding activity etc.), or by 

30 detecting mutations in a2M receptor RNA, DNA or a2M receptor protein (e.g. 9 

translocations in the a2M receptor nucleic acids, truncations in the <x2M receptor gene or 
protein, changes in nucleotide or amino acid sequence relative to wild-type <x2M receptor) 
that cause decreased expression or activity of a2M receptor. Such diseases and disorders 
include but are not limited to those described in Sections 5.7, 5.8, and 5.9. By way of 

35 example, levels of the a2M receptor protein can be detected by immunoassay, levels of a2M 
receptor RNA can be detected by hybridization assays (e.g., Northern blots, in situ- 
hybridization), <x2M receptor activity can be assayed by measuring binding activities in vivo 
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or in vitro. Translocations, deletions, and point mutations in a2M receptor nucleic acids can 
be detected by Southern blotting, FISH, RFLP analysis, SSCP, PGR using primers, 
preferably primers that generate a fragment spanning at least most of the a2M receptor gene, 
sequencing of a2M receptor genomic DNA or cDNA obtained from the patient, etc. 

5 In a preferred embodiment, levels of <x2M receptor mKNA or protein in a patient 

sample are detected or measured relative to the levels present in an analogous sample from a 
subject not having the malignancy or hyperproliferative disorder. Decreased levels indicate 
that the subject may develop, or have a predisposition to developing, viral infection, 
malignancy, or hyperproliferative disorder. 

lO In another specific embodiment, diseases and disorders involving a deficient immune 

responsiveness resulting in cell proliferation or in which cell proliferation is desirable for 
treatment, are diagnosed, or their suspected presence can be screened for, or a predisposition 
to develop such disorders can be detected, by detecting increased levels of the <x2M receptor 
protein, a2M receptor RNA, or the a2M receptor functional activity (e.g., HSP binding or 

1 5 a2M receptor antibody, etc.), or by detecting mutations in a2M receptor RNA, DNA or 
protein (e.g., translocations in a2M receptor nucleic acids, truncations in the gene or protein, 
changes in nucleotide or amino acid sequence relative to wild-type a2M receptor) that cause 
increased expression or activity of the <x2M receptor. Such diseases and disorders include, 
but are not limited to, those described in Sections 5.7, 5.8, and 5.9. By way of example, 

20 levels of the a2M receptor protein, levels of a2M receptor RNA, a2M receptor binding 

activity, and the presence of translocations or point mutations can be determined as described 
above. 

In a specific embodiment, levels of o2M receptor mRNA or protein in a patient 
sample are detected or measured, relative to the levels present in an analogous sample from a 

25 subject not having the disorder, in which increased levels indicate that the subject has, or has 
a predisposition to, an autoimmune disorder. 

Kits for diagnostic use are also provided, that comprise in one or more containers an 
anti-a2M receptor antibody, and, optionally, a labeled binding partner to the antibody. 
Alternatively, the anti-a2M receptor antibody can be labeled (with a detectable marker, e.g., 

30 a chemiluminescent, enzymatic, fluorescent, or radioactive moiety). A kit is also provided 
that comprises in one or more containers a nucleic acid probe capable of hybridizing to a2M 
receptor RNA. In a specific embodiment, a kit can comprise in one or more containers a pair 
of primers (e.g., each in the size range of 6-30 nucleotides) that are capable of priming 
amplification [e.g. 9 by polymerase chain reaction (see e.g., Innis et al 9 1990, PCR Protocols, 

35 Academic Press, Inc., San Diego, CA), ligase chain reaction (see EP 320,308) use of QP 
replicase, cyclic probe reaction, or other methods known in the art] under appropriate 
reaction conditions of at least a portion of an a2M receptor nucleic acid. A kit can optionally 
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further comprise in a container a predetermined amount of a purified a2M receptor protein or 
nucleic acid, e.g., for use as a standard or control. 

5.6 THERAPEUTIC USES 

5 The invention further encompasses methods for modulating the immune response. 

The ct2M receptor recognizes and transports antigenic peptide complexes (e.g., HSP- 
antigenic peptide complexes) for the purpose of presenting such antigenic molecules to cells 
of the immune system and eliciting an immune response. Thus, the compositions and 
methods of the invention may be used for therapeutic treatment of HSP-a2M receptor-related 

10 disorders and conditions, such as autoimmune diseases, cancer and infectious diseases. In 
particular, as described in detail hereinbelow, recombinant cells comprising a2M receptor 
complexes, such as HSP-antigenic peptide complexes, antibodies and other compounds that 
interact with the <x2M receptor, or modulate the interaction between the a2M receptor and its 
ligands, e.g., HSP, as well as other compounds that modulate HSP-<x2M receptor-mediated 

1 5 processes may be used to elicit, or block, an immune response to treat such HSP- a2M 
receptor-related disorders and conditions. 

5.6.1 THERAPEUTIC USE OF IDENTIFIED AGONISTS AND ANTAGONISTS 

Compounds, such as those identified by screening methods provided herein, that 
interact with the a2M receptor (herein "a2MR"), or modulate the interaction between the 
a2M receptor and its ligand, e.g., HSP, can be useful as therapeutics. Such compounds, 
include, but are not limited to, agonists, antagonists, such as antibodies, antisense RNAs and 
ribozymes Compounds which interfere with ligand (e.g. , HSP) -<x2M receptor interaction 
can be used to block an immune response, and can be used to treat autoimmune responses 
and conditions. Other antibodies, agonists, antagonists, antisense RNAs and ribozymes may 
upregulate ligand (e.g, HSP)-a2MR interaction, activity, or expression, and would enhance 
the uptake of antigen complexes (e.g., HSP-antigen complexes), and therefore be useful in 
stimulating the host's immune system prior to, or concurrent with, the administration of a 
vaccine. Described below are methods and compositions for the use of such compounds in 
the treatment of HSP-cc2M receptor-related disorders, such as immune disorders, proliferative 
disorders, and infectious diseases. 

In one embodiment an antagonist of a2M receptor-ligand (e.g., HSP-ix2M receptor) 
interaction is used to block the immune response. Such antagonists include compounds that 
interfere with binding of a ligand (e.g., an HSP) to the receptor by competing for binding to 
the a2M receptor, the ligand, or the ligand-a2M receptor complex. 



20 



25 



30 



35 



-51- 




WO 01/92474 PCT/US01/18041 

In one embodiment, the antagonist is an antibody specific for the a2M receptor, or a 
fragment thereof which contains the HSP ligand binding site. In another embodiment the 
antagonist is an antibody specific for an HSP, which interferes with binding of the HSP to 
the receptor. 

5 In another embodiment, the antagonist is an peptide which comprises at least 

contiguous 10 amino acids of an HSP sequence. Such a peptide can bind to the ligand 
binding site of the a2M receptor a block the interaction of an HSP or HSP complex. In 
another embodiment, the antagonist is a peptide which comprises at least contiguous 10 
amino acids of a2M sequence, which, like an HSP, can bind to the <x2M receptor and 

JO interfere with the binding and uptake of HSP-antigen complexes. In yet another embodiment, 
the antagonist is a peptide which comprises at least contiguous 10 amino acids of <x2M 
receptor sequence, in particular the ECD of the <x2M receptor (or a portion thereof), which 
can bind to and "neutralize" natural ligands, such as .HSPs, <x2M, LDL, etc. 

Such peptides may be produced synthetically or by using standard molecular biology 

1 5 techniques. Amino acid sequences and nucleotide sequences of naturally occurring <x2M 
receptor ligands, such as <x2M and HSPs, are generally available in sequence databases, such 
as GenBank. Computer programs, such as Entrez, can be used to browse the database, and 
retrieve any amino acid sequence and genetic sequence data of interest by accession number. 
Methods for recombinant and synthetic production of such peptides are described in Sections 

20 5.1.1 and 5.1.2. 

Additionally, compounds, such as those identified via techniques such as those 
described hereinabove, in Section 5.2, that are capable of modulating <x2M receptor gene 
product activity can be administered using standard techniques that are well known to those 
of skill in the art. 

25 

5.6.1.1 COMPETITIVE ANTAGONISTS OF a2MR-LIGAND 
INTERACTIONS 

In one embodiment an antagonist of an a2Mr-ligand {e.g., HSP- a2M receptor) 
interaction is used to block the immune response to an antigen complex, e.g., to treat an auto- 

30 immune disorder. Such antagonists include molecules that interfere with binding by binding 
to the a2M receptor, thereby interfering with binding of a ligand (e.g. 9 HSP) to the receptor. 
An example of this type of competitive inhibitor is an antibody to a2M receptor, or a 
fragment of a2MR which contains an HSP ligand binding site. Another example of a 
competitive antagonist is a2M, or a receptor-binding fragment thereof, which itself binds to 

35 a2MR, thereby blocking the binding and uptake of HSP-antigen complexes by the cell. 

An a2MR-ligand {e.g., HSP) competitive inhibitor can be any type of molecule, 
including but not limited to a protein, nucleic acid or drug. In a preferred embodiment, an 
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HSP-a2M competitive inhibitor is an a2MR-binding or an HSP-binding peptide. Examples 
of such peptides are provided below. 

5.6.1.1.1 o2M RECEPTOR-BINDING PEPTIDES 

5 a Macroglobulin peptides 

In one embodiment of the present invention, an HSP-a2MR competitive antagonist is 
an a macroglobulin, preferably a2M, or a2MR-binding portion thereof. 

Functional expression of a2M or a2MR-binding portions thereof (including 
recombinant expression as a FX fusion protein, processing, purification and refolding) is 

10 preferably carried out as described by Holtet et al y 1994, FEBS Lett. 344:242-246. 

In a specific mode of the embodiment, an a2MR-binding portion of a2M consists of 
or comprises a fragment of the <x2M RBD consisting of at least 10 (continuous) amino acids. 
In other modes of the embodiment, die fragment consists of at least 20, 30, 40, 50, 75 or 100 
amino acids of the RBD. In specific modes of the embodiment, such fragments are not larger 

15 than 27, 138 or 153 amino acids. Most preferred peptides comprise one or both of amino 
acids Lys 1370 and Lys l374 . Such peptides include those consisting of amino acids 1299-1451 
(vRBD in FIG. 13B) (SEQ ID NO:8), 1314-1451 (SEQ ID NO:9) (RBD in FIG. 13B) or 
1366-1392 (SEQ ID NO:10) of the mature a2M protein. Other preferred peptides include 
but are not limited to those consisting of amino acids 1300-1425 (SEQ ID NO:l 1), 1300- 

20 1400 (SEQ ID NO:12), 1300-1380 (SEQ ID NO:13), 1325-1425 (SEQ ID NO:14), 1325- 
1400 (SEQ ID NO:15), 1325-1380 (SEQ ID NO:16), 1350-1425 (SEQ ID NO:17), 1350- 
1400 (SEQ ID NO:18), or 1350-1380 (SEQ ID NO:19) of the mature human a2M protein. 

Derivatives or analogs of a2M or a2MR-binding portions of a2M are also 
contemplated as competitive antagonists of HSP-a2MR complexes. Such derivative or 

25 analogs include but are not limited to those molecules comprising regions that are 
substantially homologous to <x2M, the <x2M RBD or fragments thereof (e.g., in various 
embodiments, at least 60% or 70% or 80% or 90% or 95% identity over an amino acid 
sequence of identical size or when compared to an aligned sequence in which the alignment 
is done by a computer homology program known in the art) or whose encoding nucleic acid 

30 is capable of hybridizing to a coding a2M RBD sequence, under stringent, moderately 
stringent, or nonstringent conditions. In certain specific embodiments, an <x2M derivative is 
a chimeric or fusion protein comprising an a2M protein or a2MR-binding portion thereof 
(preferably consisting of at least 10 amino acids of the a2M RBD comprising Lys 1370 and 
Lys l374 ) joined at its amino- or carboxy-tenninus via a peptide bond to an amino acid 

35 sequence of a different protein. 

In particular, <x2M derivatives can be made by altering <x2M coding sequences by 
substitutions, additions or deletions that provide for functionally equivalent molecules. Due 
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to the degeneracy of nucleotide coding sequences, other DNA sequences which encode 
substantially the same amino acid sequence as a a2M gene may be used in the practice of the 
present invention. These include but are not limited to nucleotide sequences comprising all 
or a2MR-binding portions of <x2M genes which are altered by the substitution of different 

5 codons that encode a functionally equivalent amino acid residue within the sequence, thus 
producing a silent change. Likewise, the a2M derivatives of the invention include, but are 
not limited to, those containing, as a primary amino acid sequence, all or an <x2MR-binding 
portion of the amino acid sequence of an <x2M protein, including altered sequences in which 
functionally equivalent amino acid residues are substituted for residues within the sequence 

10 resulting in a silent change. For example, one or more amino acid residues within the 
sequence can be substituted by another amino acid of a similar polarity which acts as a 
functional equivalent, resulting in a silent alteration. Substitutes for an amino acid within the 
sequence may be selected from other members of the class to which the amino acid belongs. 
For example, the nonpolar (hydrophobic) amino acids include alanine, leucine, isoleucine, 

15 valine, proline, phenylalanine, tryptophan and methionine. The polar neutral amino acids 
include glycine, serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The 
positively charged (basic) amino acids include arginine, lysine and histidine. The negatively 
charged (acidic) amino acids include aspartic acid and glutamic acid. 

The <x2M derivatives and analogs of the invention can be produced by various 

2o methods known in the art. The manipulations which result in their production can occur at 
the gene or protein level. For example, the cloned a2M gene sequence can be modified by 
any of numerous strategies known in the art (Maniatis, T., 1990, Molecular Cloning, A 
Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York). The sequence can be cleaved at appropriate sites with restriction endonuclease(s), 

25 followed by further enzymatic modification if desired, isolated, and ligated in vitro. In the 
production of the gene encoding a derivative or analog of a2M, care should be taken to 
ensure that the modified gene remains within the same translational reading frame as a2M, 
uninterrupted by translational stop signals, in the gene region where the desired a2M activity 
is encoded. 

30 Manipulations of the a2M sequence may also be made at the protein level. Included 

within the scope of the invention are a2M protein fragments or other derivatives or analogs 
which are differentially modified during or after translation, e.g., by glycosylation, 
acetylation, phosphorylation, amidation, derivatization by known protectingftlocking groups, 
proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, etc. Any of 

35 numerous chemical modifications may be carried out by known techniques, including but not 
limited to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, 
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V8 protease, NaBH 4 ; acetylation, formylation, oxidation, reduction; metabolic synthesis in 
the presence of tunicamycin; etc. 

In addition, analogs and derivatives of a2M can be chemically synthesized. For 
example, an a2MR-binding portion of a2M can be synthesized by use of a peptide 

5 synthesizer. Furthermore, if desired, nonclassical amino acids or chemical amino acid 

analogs can be introduced as a substitution or addition into the a2M sequence. Non-classical 
amino acids include but are not limited to the D-isomers of the common amino acids, a- 
amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, y- Abu, e-Ahx, 
6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, 

\0 norleucine, norvaline, hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, p-alanine, fliioro-amino acids, designer 
amino acids such as 0-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, 
and amino acid analogs in general. Furthermore, the amino acid can be D (dextrorotary) or L 
(levorotary). 

1 5 In other specific modes of the embodiment, an HSP-a2MR competitive antagonist is 

another a macroglobulin or a2MR-binding portion thereof, for example an a macroglobulin 
RBD domain selected from Nielsen et al., supra, Fig. 3, Group A. 

RAP 

20 In one embodiment of the present invention, an HSP-a2MR competitive antagonist is 

a2MR-associated protein (RAP) (Genbank accession no. A39875) or an a2MR-binding 
portion thereof. In a specific mode of the embodiment, an a2MR-binding portion of RAP 
consists of or comprises a fragment of the RAP RBD consisting of at least 10 (continuous) 
amino acids. In other modes of the embodiment, the fragment consists of at least 20, 30, 40, 

25 50, 75 or 100 amino acids of the RBD. In specific modes of the embodiment, such fragments 
are not larger than 28, 50 or 100 amino acids. In other specific modes of the embodiment, an 
o2MR-binding portion of RAP comprises an a2MR-binding portion of domain 1 or 3, e.g. as 
depicted in Nielsen et al, supra, Fig. 3, Group D or E. Expression of recombinant RAP or 
an a2MR-binding portion thereof, e.g. domain 1 or 3, is preferably achieved as described by 

30 Andersen et al 9 supra). 

5 .6. 1 . 1 .2 HSP-BINDDSfG PEPTIDES 

a2MR peptides 

In one embodiment of the present invention, an HSP-a2MR competitive antagonist is 
35 a2MR peptide, preferably a soluble peptide, that can bind to HSPs and therefore 
competitively inhibit HSP binding to the native receptor. 



-55- 



WO 01/92474 



PCT7US01/18041 



Functional expression of HSP-binding portions of a2MR is preferably carried out as 
described for the CR8 domain by Huang et aL 9 1999, J. Biol. Chem 274:14130-14136. 
Briefly, to maintain proper folding, the protein is expressed as a GST fusion, expressed 
recombinantly, the GST portion cleaved, uncleaved protein removed on GSH-Sepharose, and 

5 cleaved protein refolded. Since the complement repeats bind to calcium, proper folding is 
assayed by measuring the binding of the refolded protein to calcium. 

In a specific mode of the embodiment, an HSP-binding portion of <x2MR consists of 
or comprises at least one complement repeat, most preferably selected from CR3-CR10. In 
another specific mode of the embodiment, an HSP-binding portion of <x2MR comprises a 

I q cluster of complement repeats, most preferably CHL In other modes of the embodiment, the 
HSP-binding portion consists of at least 10, more preferably at least 20, yet more preferably 
at least 30, yet more preferably at least 40, and most preferably at least 80 (continuous) 
amino acids. In specific modes of the embodiment, such fragments are not larger than 40-45 
amino acids. In other specific modes of the embodiment, such fragments are not larger than 

15 80-90 amino acids. Exemplary preferred peptides include but are not limited to those 

consisting of amino acids 25-68 (SEQ ID NO:20), 25-110 (SEQ ID NO:21), 68-110 (SEQ ID 
NO:22), 853-894 (SEQ ID NO:23), 853-934 (SEQ ID NO:24), 853-974 (SEQ ID NO:25), 
853-1013 (SEQ ID NO:26), 853-1060 (SEQ ID NO:27), 853-1102 (SEQ ID NO:28), 853- 
1183 (SEQ ID NO:29), 895-934 (SEQ ID NO:30), 895-974 (SEQ ID NO:31), 895-1013 

20 (SEQ ID NO:32), 895-1060 (SEQ ID NO:33), 895-1 102 (SEQ ID NO:34), 895-1 183 (SEQ 
ID NO;35), 935-974 (SEQ ID NO:36), 9354013 (SEQ ID NO:37), 935-1060 (SEQ ID 
NO:38), 935-1 102 (SEQ ID NO:39), 935-1 183 (SEQ ID NO:40), 975-1013 (SEQ ID 
NO:41), 975-1060 (SEQ ID NO:42), 975-1 143 (SEQ ID NO:43), 975-1183 (SEQ ID 
NO:44), 1014-1060 (SEQ ID NO:45), 1014-1102 (SEQ ID NO:46), 1014-1183 (SEQ ID 

25 NO:47), 1061-1 102 (SEQ ID NO;48), 1061-1 143 (SEQ ID NO:49), 1061-1 183 (SEQ ID 
NO:50), 1103-1143 (SEQIDNO:51), 1103-1183 (SEQ IDNO:52), or 1144-1183 (SEQ ID 
NO:53) of human <x2MR. 

Derivatives or analogs of HSP-binding portions <x2MR also contemplated as 
competitive antagonists of HSP-a2MR complexes. Such derivative or analogs include but 

30 are not limited to those molecules comprising regions that are substantially homologous to 
the extracellular domain of a2MR or fragments thereof (e.g., in various embodiments, at 
least 60% or 70% or 80% or 90% or 95% identity over an amino acid sequence of identical 
size or when compared to an aligned sequence in which the alignment is done by a computer 
homology program known in the art) or whose encoding nucleic acid is capable of 

35 hybridizing to a sequence encoding an <x2MR HSP-binding sequence, under stringent, 

moderately stringent, or nonstringent conditions. In certain specific embodiments, an a2MR 
derivative is a chimeric or fusion protem comprising an HSP-binding portion of a2MR, 



-56- 



WO 01/92474 



PCIYUS01/18041 



preferably consisting of at least one complement repeat of Cl-II) joined at its amino- or 
carboxy-terminus via a peptide bond to an amino acid sequence of a different protein. Such a 
chimeric protein can be produced recombinantly as described above, by omitting the 
cleavage repurification steps. 

5 Other HSP-binding o2MR derivatives can be made by altering a2MR coding 

sequences by substitutions, additions or deletions that provide for functionally equivalent 
molecules. Due to the degeneracy of nucleotide coding sequences, other DNA sequences 
which encode substantially the same amino acid sequence as an HSP-binding a2MR gene or 
gene fragment may be used in the practice of the present invention. Selection of suitable 

I o alterations and production of HSP-binding a2MR derivatives can be made applying the same 
principles described above for a2M derivatives and using the general methods described in 
Sections 5.1.1 and 5.1.2. 

. HSP peptides 

25 In another mode of the embodiment, the antagonist is an peptide which comprises at 

least contiguous 10 amino acids of an HSP sequence. Such a peptide can bind to the ligand 
binding site of the ct2M receptor a block the interaction of an HSP or HSP complex. . 

Such peptides may be produced synthetically or by using standard molecular biology 
techniques. Amino acid sequences and nucleotide sequences of naturally occurring HSPs are 

20 generally available in sequence databases, such as GenBank. Computer programs, such as 
Entrez, can be used to browse the database, and retrieve any amino acid sequence and genetic 
sequence data of interest by accession number. Methods for recombinant and synthetic 
production of such peptides are described in Sections 5.1.1 and 5.1.2. 

Additionally, compounds, such as those identified via techniques such as those 

25 described hereinabove, in Section 5.2, that are capable of modulating a2M receptor gene 
product activity can be administered using standard techniques that are well known to those 
ofskillinthe art. 

5.6.2 THERAPEUTIC USE OF THE a2M RECEPTOR AGAINST CANCER 
AND INFECTIOUS DISEASES 

30 

In another embodiment, symptoms of certain a2M receptor gene disorders, such as 
autoimmune disorders, or proliferative or differentiative disorders causing tumorigenesis or 
cancer, may be ameliorated by modulating the level of a2M receptor gene expression and/or 
<x2M receptor gene product activity. In one embodiment, for example, a decrease in a2M 
35 receptor gene expression may be useful to decrease <x2M receptor activity, and ameliorate the 
symptoms of an autoimmune disorder. In this case, the level of <x2M receptor gene 
expression may be decreased by using a2M receptor gene sequences in conjunction with 
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well-known antisense, gene "knock-out," ribozyme and/or triple helix methods. In another 
embodiment, an increase in a2M receptor gene expression may be desired to compensate for 
a mutant or impaired gene in an HSP-a2M receptor-mediated pathway, and to ameliorate the 
symptoms of an HSP- a2M receptor-related disorder. 

5 Among the compounds that may exhibit the ability to modulate the activity, 

expression or synthesis of the a2M receptor gene, including the ability to ameliorate the 
symptoms of an HSP-a2M receptor related disorder are antisense, ribozyme, and triple helix 
molecules. Such molecules may be designed to reduce or inhibit either unimpaired, or if 
appropriate, mutant target gene activity. Techniques for the production and use of such 

20 molecules are well known to those of skill in the art. 

Antisense RNA and DNA molecules act to directly block the translation of mRNA by 
hybridizing to targeted mRNA and preventing protein translation. Antisense approaches 
involve the design of oligonucleotides that are complementary to a target gene mRNA. The 
antisense oligonucleotides will bind to the complementary target gene mRNA transcripts and 

15 prevent translation. Absolute complementarity, although preferred, is not required. 

A sequence "complementary" to a portion of an RNA, as referred to herein, means a 
sequence having sufficient complementarity to be able to hybridize with the RNA, forming a 
stable duplex; in the case of double-stranded antisense nucleic acids, a single strand of the 
duplex DNA may thus be tested, or triplex formation may be assayed. The ability to 

20 hybridize will depend on both the degree of complementarity and the length of the antisense 
nucleic acid. Generally, the longer the hybridizing nucleic acid, the more base mismatches 
with an RNA it may contain and still form a stable duplex (or triplex, as the case may be). 
One skilled in the art can ascertain a tolerable degree of mismatch by use of standard 
procedures to determine the melting point of the hybridized complex. 

25 In one embodiment, oligonucleotides complementary to non-coding regions of the 

<x2M receptor gene could be used in an antisense approach to inhibit translation of 
endogenous <x2M receptor mRNA. Antisense nucleic acids should be at least six nucleotides 
in length, and are preferably oligonucleotides ranging from 6 to about 50 nucleotides in 
length. In specific aspects the oligonucleotide is at least 10 nucleotides, at least 17 

30 nucleotides, at least 25 nucleotides or at least 50 nucleotides. 

In an embodiment of the present invention, oligonucleotides complementary to the 
nucleic acids encoding the HSP receptor ligand binding domain are used. 

Regardless of the choice of target sequence, it is preferred that in vitro studies are 
first performed to quantitate the ability of the antisense oligonucleotide to inhibit gene 

35 expression. It is preferred that these studies utilize controls that distinguish between antisense 
gene inhibition and nonspecific biological effects of oligonucleotides. It is also preferred that 
these studies compare levels of the target RNA or protein with that of an internal control 
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RNA or protein. Additionally, it is envisioned that results obtained using the antisense 
oligonucleotide are compared with those obtained using a control oligonucleotide. It is 
preferred that the control oligonucleotide is of approximately the same length as the test 
oligonucleotide and that the nucleotide sequence of the oligonucleotide differs from the 
5 antisense sequence no more than is necessary to prevent specific hybridization to the target 
sequence. 

The oligonucleotides can be DNA or RNA or chimeric mixtures or derivatives or 
modified versions thereof, single-stranded or double-stranded. The oligonucleotide can be 
modified at the base moiety, sugar moiety, or phosphate backbone, for example, to improve 

!0 stability of the molecule, hybridization, etc. The oligonucleotide may include other 

appended groups such as peptides (e.g., for targeting host cell receptors in vivo), or agents 
facilitating transport across the cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl. 
Acad. Sci. U.S.A. 86, 6553-6556; Lemaitre etal, 1987, Proc. Natl. Acad. Sci. 84, 648-652; 
PCT Publication No. WO88/09810, published December 15, 1988) or the blood-brain barrier 

15 (see, e.g., PCT Publication No. WO89/10134, published April 25, 1988), hybridization- 
triggered cleavage agents (see, e.g., Krol et al, 1988, BioTechniques 6, 958-976) or 
intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5, 539-549). To this end, the 
oligonucleotide may be conjugated to another molecule, e.g., a peptide, hybridization 
triggered cross-linking agent, transport agent, hybridization-triggered cleavage agent, etc. 

20 The antisense oligonucleotide may comprise at least one modified base moiety which 

is selected from the group including but not limited to 5-fluorouracil, 5-bromouracil, 
5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 
5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 
5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 

25 N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosiylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 

30 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 
5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3- N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 

The antisense oligonucleotide may also comprise at least one modified sugar moiety 
selected from the group including but not limited to arabinose, 2-fluoroarabinose, xylulose, 
35 andhexose. 

Li yet another embodiment, the antisense oligonucleotide comprises at least one 
modified phosphate backbone selected from the group consisting of a phosphorothioate (S- 
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ODNs), a phosphorodithioate, a phosphoramidothioate,- a phosphoramidate, a 
phosphordiamidate, a methylphosphonate, an alkyl phosphotriester, and a formacetal or 
analog thereof. 

In yet another embodiment, the antisense oligonucleotide is an a-anomeric 
5 oligonucleotide. An a-anomeric oligonucleotide forms specific double-stranded hybrids with 
complementary RNA in which, contrary to the usual p-units, the strands run parallel to each 
other (Gautier et al. 9 1987, Nucl. Acids Res. 15, 6625-6641). The oligonucleotide is a 2'-0- 
methylribonucleotide (Inoue et aL, 1987, Nucl. Acids Res. 15, 6131-6148), or a chimeric 
RNA-DNA analogue (Inoue et al, 1987, FEBS Lett. 215, 327-330). 
10 Oligonucleotides of the invention may be synthesized by standard methods known in 

the art, e.g. by use of an automated DNA synthesizer (such as are commercially available 
from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate oligonucleotides 
may be synthesized by the method of Stein et al (1988, Nucl. Acids Res. 16, 3209), 
methylphosphonate oligonucleotides can be prepared by use of controlled pore glass polymer 
15 supports (Sarin et al, 1988, Proc. Natl. Acad. Sci. U.S.A. 85, 7448-7451), etc. 



20 achieved because specific target mRNAs are digested by RNAse H after they have 

hybridized with the antisense phosphorothioate oligonucleotides (S-ODNs). Since no rules 
exist to predict which antisense S-ODNs will be more successful, the best strategy is 
completely empirical and consists of trying several antisense S-ODNs. Antisense 
phosphorothioate oligonucleotides (S-ODNs) will be designed to target specific regions of 

25 mRNAs of interest. Control S-ODNs consisting of scrambled sequences of the antisense S- 
ODNs will also be designed to assure identical nucleotide content and minimize differences 
potentially attributable to nucleic acid content. All S-ODNs can be synthesized by Oligos 
Etc. (Wilsonville, OR). In order to test the effectiveness of the antisense molecules when 
applied to cells in culture, such as assays for research purposes or vivo gene therapy 

30 protocols, cells will be grown to 60-80% confluence on 100 mm tissue culture plates, rinsed 
with PBS and overlaid with lipofection mix consisting of 8 ml Opti-MEM, 52.8 
Lipofectin, and a final concentration of 200 nM S-ODNs. Lipofections will be carried out 
using Lipofectin Reagent and Opti-MEM (Gibco BRL). Cells will be incubated in the 
presence of the lipofection mix for 5 hours. Following incubation the medium will be 

3 5 replaced with complete DMEM. Cells will be harvested at different time points post- 
lipofection and protein levels will be analyzed by Western blot. 



While antisense nucleotides complementary to the target gene coding region 
sequence could be used, those complementary to the transcribed, untranslated region are 
most preferred. 

In one embodiment of the present invention, gene expression downregulation is 
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Antisense molecules should be targeted to cells that express the target gene, either 
directly to the subject in vivo or to cells in culture, such as in ex vivo gene therapy protocols. 
A number of methods have been developed for delivering antisense DNA or RNA to cells; 
e.g 9 antisense molecules can be injected directly into the tissue site, or modified antisense 

5 molecules, designed to target the desired cells (e.g., antisense linked to peptides or antibodies 
that specifically bind receptors or antigens expressed on the target cell surface) can be 
administered systemically. 

However, it is often difficult to achieve intracellular concentrations of the antisense 
sufficient to suppress translation of endogenous mRNAs. Therefore a preferred approach 

jq utilizes a recombinant DNA construct in which the antisense oligonucleotide is placed under 
the control of a strong pol III or pol II promoter. The use of such a construct to transfect 
target cells in the patient will result in the transcription of sufficient amounts of single 
stranded RNAs that will form complementary base pairs with the endogenous target gene 
transcripts and thereby prevent translation of the target gene mRNA. For example, a vector 

1 5 can be introduced e.g. , such that it is taken up by a cell and directs the transcription of an 
antisense RNA. Such a vector can remain episomal or become chromosomally integrated, as 
long as it can be transcribed to produce the desired antisense RNA. Such vectors can be 
constructed by recombinant DNA technology methods standard in the art. Vectors can be 
plasmid, viral, or others known in the art, used for replication and expression in mammalian 

20 cells. Expression of the sequence encoding the antisense RNA can be by any promoter 
known in the art to act in mammalian, preferably human cells. Such promoters can be 
inducible or constitutive. Such promoters include but are not limited to: the SV40 early 
promoter region (Bernoist and Chambon, 1981, Nature 290, 304-310), the promoter 
contained in the 3' long terminal repeat of Rous sarcoma virus (Y amamoto et aL 9 1980, Cell 

25 22, 787-797), the herpes thymidine kinase promoter (Wagner et al, 1981, Proc. Natl. Acad. 
Sci. U.S.A. 78, 1441-1445), the regulatory sequences of the metallothionein gene (Brinster et 
aL, 1982, Nature 296, 39-42), etc. Any type of plasmid, cosmid, YAC or viral vector can be 
used to prepare the recombinant DNA construct which can be introduced directly into the 
tissue site. Alternatively, viral vectors can be used that selectively infect the desired tissue, 

30 in which case administration may be accomplished by another route (e.g., systemically), 

Ribozyme molecules designed to catalytically cleave target gene mRNA transcripts 
can also be used to prevent translation of target gene mRNA and, therefore, expression of 
target gene product. (See, e.g., PCT International Publication WO90/1 1364, published 
October 4, 1990; Sarver et al, 1990, Science 247, 1222-1225). In an embodiment of the 

35 present invention, oligonucleotides which hybridize to the HSP receptor gene are designed to 
be complementary to the nucleic acids encoding the HSP receptor ligand binding domain. 
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Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage 
of RNA. (For a review, see Rossi, 1994, Current Biology 4, 469-471). The mechanism of 
ribozyme action involves sequence specific hybridization of the ribozyme molecule to 
complementary target RNA, followed by an endonucleolytic cleavage event. The 

5 composition of ribozyme molecules must include one or more sequences complementary to 
the target gene mRNA, and must include the well known catalytic sequence responsible for 
mRNA cleavage. For this sequence, see, e.g. 9 U.S. Patent No. 5,093,246, which is 
incorporated herein by reference in its entirety. 

While ribozymes that cleave mRNA at site specific recognition sequences can be 

IQ used to destroy target gene mRNAs, the use of hammerhead ribozymes is preferred. 

Hammerhead ribozymes cleave mRNAs at locations dictated by flanking regions that form 
complementary base pairs with the target mRNA. The sole requirement is that the target 
mRNA have the following sequence of two bases: 5-UG-3'. The construction and 
production of hammerhead ribozymes is well known in the art and is described more fully in 

15 Myers, 1995, Molecular Biology and Biotechnology: A Comprehensive Desk Reference, 
VCH Publishers/New York, (see especially fig. 4, p. 833).and inHaseloff & Gerlach, 1988, 
Nature, 334, 585-591, which is incorporated herein by reference in its entirety. 

Preferably the ribozyme is engineered so that the cleavage recognition site is located 
near the 5' end of the target gene mRNA, i.e., to increase efficiency and minimize the 

20 intracellular accumulation of non-functional mRNA transcripts. 

The ribozymes of the present invention also include RNA endoribonucleases 
(hereinafter "Cech-type ribozymes") such as the one that occurs naturally in Tetrahymena 
thermophila (known as the IVS, or L-19 IVS RNA) and that has been extensively described 
by Thomas Cech and collaborators (Zaug et aL, 1984, Science, 224, 574-578; Zaug and 

25 Cech, 1986, Science, 231, 470-475; Zaug et al, 1986, Nature, 324, 429-433; published 
International patent application No. WO 88/04300 by University Patents Inc.; Been & Cech, 
1986, Cell, 47, 207-216). The Cech-type ribozymes have an eight base pair active site which 
hybridizes to a target RNA sequence whereafter cleavage of the target RNA takes place. The 
invention encompasses those Cech-type ribozymes which target eight base-pair active site 

30 sequences that are present in the target gene. 

As in the antisense approach, the ribozymes can be composed of modified 
oligonucleotides (e.g., for improved stability, targeting, etc.) and should be delivered to cells 
that express the target gene in vivo, A preferred method of delivery involves using a DNA 
construct "encoding" the ribozyme under the control of a strong constitutive pol EI or pol II 

35 promoter, so that transfected cells will produce sufficient quantities of the ribozyme to 
destroy endogenous target gene messages and inhibit translation. Because ribozymes unlike 
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antisense molecules, are catalytic, a lower intracellular concentration is required for 
efficiency. 

Endogenous target gene expression can also be reduced by inactivating or "knocking 
out" the target gene or its promoter using targeted homologous recombination {e.g., see 

5 Smithies et aL, 1985, Nature 317, 230-234; Thomas & Capecchi, 1987, Cell 51, 503-5 12; 
Thompson et al, 1989, Cell 5, 313-321; each of which is incorporated by reference herein in 
its entirety). For example, a mutant, non-functional target gene (or a completely unrelated 
DNA sequence) flanked by DNA homologous to the endogenous target gene (either the 
coding regions or regulatory regions of the target gene) can be used, with or without a 

10 selectable marker and/or a negative selectable marker, to transfect cells that express the target 
gene in vivo. Insertion of the DNA construct, via targeted homologous recombination, 
results in inactivation of the target gene. Such approaches are particularly suited 
modifications to ES (embryonic stem) cells can be used to generate animal offspring with an 
inactive target gene {e.g., see Thomas & Capecchi, 1987 and Thompson, 1989, supra). 

2 5 However this approach can be adapted for use in humans provided the recombinant DNA 
constructs are directly administered or targeted to the required site in vivo using appropriate 
viral vectors. 

Alternatively, endogenous target gene expression can be reduced by targeting 
deoxyribonucleotide sequences complementary to the regulatory region of the target gene 

20 (i.e., the target gene promoter and/or enhancers) to form triple helical structures that prevent 
transcription of the target gene in target cells in the body. (See generally, Helene, 1991, 
Anticancer Drug Des., 6(6), 569-584; Helene et aL, 1992, Ann. N.Y. Acad. Sci., 660, 27-36; 
and Maher, 1992, Bioassays 14(12), 807-815). 

Nucleic acid molecules to be used in triple helix formation for the inhibition of 

25 transcription should be single stranded and composed of deoxyribonucleotides. The base 
composition of these oligonucleotides must be designed to promote triple helix formation via 
Hoogsteen base pairing rules, which generally require sizeable stretches of either purines or 
pyrimidines to be present on one strand of a duplex. Nucleotide sequences may be 
pyrimidine-based, which will result in TAT and CGC + triplets across the three associated 

30 strands of the resulting triple helix. The pyrimidine-rich molecules provide base 
complementarity to a purine-rich region of a single strand of the duplex in a parallel 
orientation to that strand. In addition, nucleic acid molecules may be chosen that are purine- 
rich, for example, contain a stretch of G residues. These molecules will form a triple helix 
with a DNA duplex that is rich in GC pairs, in which the majority of the purine residues are 

35 located on a single strand of the targeted duplex, resulting in GGC triplets across the three 
strands in the triplex. 
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Alternatively, the potential sequences that can be targeted for triple helix formation 
may be increased by creating a so called "switchback 11 nucleic acid molecule. Switchback 
molecules are synthesized in an alternating 5'-3\ 3' -5* manner, such that they base pair with 
first one strand of a duplex and then the other, eliminating the necessity for a sizeable stretch 

5 of either purines or pyrimidines to be present on one strand of a duplex. 

In instances wherein the antisense, ribozyme, and/or triple helix molecules described 
herein are utilized to inhibit mutant gene expression, it is possible that the technique may so 
efficiently reduce or inhibit the transcription (triple helix) and/or translation (antisense, 
ribozyme) of mRNA produced by normal target gene alleles that the possibility may arise 

10 wherein the concentration of normal target gene product present may be lower than is 

necessary for a normal phenotype. In such cases, to ensure that substantially normal levels of 
target gene activity are maintained, therefore, nucleic acid molecules that encode and express 
target gene polypeptides exhibiting normal target gene activity may, be introduced into cells 
via gene therapy methods such as those described, below, in Section 5.6.3 that do not contain 

1 5 sequences susceptible to whatever antisense, ribozyme, or triple helix treatments are being 
utilized. Alternatively, in instances whereby the target gene encodes an extracellular protein, 
it may be preferable to co-administer normal target gene protein in order to maintain the 
requisite level of target gene activity. 

Anti-sense RNA and DNA, ribozyme, and triple helix molecules of the invention may 

20 be prepared by any method known in the art for the synthesis of DNA and RNA molecules, 
as discussed above. These include techniques for chemically synthesizing oligodeoxyri- 
bonucleotides and oligoribonucleotides well known in the art such as for example solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by in 
vitro and in vivo transcription of DNA sequences encoding the antisense RNA molecule. 

25 Such DNA sequences may be incorporated into a wide variety of vectors that incorporate 
suitable RNA polymerase promoters such as the T7 or SP6 polymerase promoters. 
Alternatively, antisense cDNA constructs that synthesize antisense RNA constitutively or 
inducibly, depending on the promoter used, can be introduced stably into cell lines. 



30 5.6.3 GENE REPLACEMENT THERAPY 

With respect to an increase in the level of normal <x2M receptor gene expression 
and/or a2M receptor gene product activity, a2M receptor gene nucleic acid sequences can, 
for example, be utilized for the treatment of immune disorders resulting in proliferative 
disorders such as cancer. Such treatment can be administered, for example, in the form of 

35 gene replacement therapy. Specifically, one or more copies of a normal a2M receptor gene 
or a portion of the a2M receptor gene that directs the production of an a2M receptor gene 
product exhibiting normal <x2M receptor gene function, may be inserted into the appropriate 
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cells within a patient, using vectors that include, but are not limited to adenovirus, adeno- 
associated virus, and retrovirus vectors, in addition to other particles that introduce DNA into 
cells, such as liposomes. 

Gene replacement therapy techniques should be capable of delivering a2M receptor 

5 gene sequences to cell types that express the HSP receptor within patients. Thus, in one 
embodiment, techniques that are well known to those of skill in the art (see, e.g., PCT 
Publication No. WO89/10134, published April 25, 1988) can be used to enable a2M receptor 
gene sequences to be delivered to developing cells of the myeloid lineage, for example, to the 
bone marrow. In another specific embodiment, gene replacement can be accomplished using 

10 macrophages in vitro, and delivered to a patient using the techniques of adoptive 
immunotherapy. 

In another embodiment, techniques for delivery involve direct administration of such 
a2M receptor gene sequences to the site of the cells in which the a2M receptor gene 
sequences are to be expressed, e.g., directly at the site of the tumor. 

15 Additional methods that may be utilized to increase the overall level of a2M receptor 

gene expression and/or a2M receptor gene product activity include the introduction of 
appropriate a2M receptor-expressing cells, preferably autologous cells, into a patient at 
positions and in numbers that are sufficient to ameliorate the symptoms of an a2M receptor 
disorder. Such cells may be either recombinant or non-recombinant. 

20 Among the cells that can be administered to increase the overall level of a2M 

receptor gene expression in a patient are ceils that normally express the <x2M receptor gene. 

Alternatively, cells, preferably autologous cells, can be engineered to express a2M 
receptor gene sequences, and may then be introduced into a patient in positions appropriate 
for the amelioration of the symptoms of an a2M receptor disorder or a proliferative or viral 

25 disease, e.g., cancer and tumorigenesis. Alternately, cells that express an unimpaired a2M 
receptor gene and that are from a MHC matched individual can be utilized, and may include, 
for example, brain cells. The expression of the a2M receptor gene sequences is controlled by 
the appropriate gene regulatory sequences to allow such expression in the necessary cell 
types. Such gene regulatory sequences are well known to the skilled artisan. Such cell-based ^ 

30 gene therapy techniques are well known to those skilled in the art, see, e.g, Anderson, U.S. 
Patent No. 5,399,349. 

When the cells to be administered are non-autqlogous cells, they can be administered 
using well known techniques that prevent a host immune response against the introduced 
cells from developing. For example, the cells may be introduced in an encapsulated form 

35 which, while allowing for an exchange of components with the immediate extracellular 
environment, does not allow the introduced cells to be recognized by the host immune 
system. 
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5.6.4 DELIVERY OF SOLUBLE a2M RECEPTOR POLYPEPTIDES 
Genetically engineered cells that express soluble a2M receptor ECDs or fusion 

proteins, e.g., fusion Ig molecules can be administered in vivo where they may function as 
"bioreactors" that deliver a supply of the soluble molecules. Such soluble a2M receptor 
5 polypeptides and fusion proteins, when expressed at appropriate concentrations, should 
neutralize or "mop up" HSPs or other native ligand for the a2M receptor, and thus act as 
inhibitors of a2M receptor activity and may therefore be used to treat HSP-a2M receptor- 
related disorders and diseases, such as autoimmune disorders, proliferative disorders, and 
infectious diseases. 

10 

5.6.5 DELIVERY OF DOMINANT NEGATIVE MUTANTS 

In another embodiment of the invention, dominant negative mutants ("dominant 
negatives") may be used therapeutically to block the immune response to an HSP-antigen 
complex, e.g., to treat an auto-immune disorder. In general, such dominant-negatives are 

15 mutants which, when expressed, interact with ligand (i.e., HSP-antigenic molecule 
complex), but lack one or more functions, i.e. endocytotic functions and/or signaling 
functions, of normal <x2MR. Such mutants interfere with the function of normal <x2MR in the 
same cell or in a different cell, e.g. by titration of HSP-peptide complexes from the wild type 
receptor. Such a mutation, for example, can be one or more point mutation(s), a deletion, 

20 insertion, or other mutation in either the extracellular of the 515 kDa subunit, or the 

extracellular, transmembrane or intracellular domains of the 85 kDa subunit of the alpha(2) 
macroglobulin receptor (see Krieger and Herz, 1994, Annu. Rev. Biochem 63:601-637 for 
a2MR subunit configuration). However, in construction of dominant negative mutations in 
the either subunit, care should be taken to ensure that the cleavage domain (signaling 

25 cleavage between aas 3525 and 3526 of the precursor of a2MR) remains intact so that the 
515 kDa subunit is processed and presented on the cell surface. Additionally, care should be 
taken to ensure that the domains by which the two subunits associate should also remain 
functional. For example, in a specific embodiment, the C-terminal intracellular domain of 
the 85 kDa subunit is truncated. In another embodiment, a point mutation on the N-terminal 

30 515 kDa subunit blocks endocytosis but not ligand binding. In another embodiment, the N- 
terminal 515 kDa subunit is expressed as a fusion protein, wherein the C-terminus of said 
fusion protein is the transmembrane domain and optionally the intracellular domain, of 
another Type I single transmembrane receptor. 

Expression of a such a dominant negative mutation in cell can block uptake of ligand 

35 by normal functional receptors in the same or neighboring cells by titrating out the amount of 
available ligand. Thus, a recombinant antigen presenting cell expressing such a dominant 
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negative can be used to titrate out HSP-antigenic molecule complexes when administered to 
a patient in need of treatment for an autoimmune disorder. 

5.6.6 EXTRACORPOREAL METHODS FOR MODULATING THE IMMUNE 
RESPONSE 

The present invention also relates to methods for modulating an immune response in 
a patient by altering the levels a2M receptor ligand in the bloodstream using extracorporeal 
methods. a2M receptor acts as a heat shock protein receptor in <x2M receptor-expressing 
cells, such as macrophages and dendritic cells. Binding of HSPs or HSP antigenic peptide 
complexes to such <x2M receptor-expressing cells results in internalization of the HSP and 
the re-presentation of peptides chaperoned by the HSP. However, because <x2M receptor has 
a diverse roles in different cell types and binds numerous non-HSP ligands, competition 
between a2M receptor ligands reduces the ability of HSPs and HSP complexes to access 
a2M receptor. 

The Applicant has discovered that depleting the blood of non-HSP-a2M receptor 
ligands and transfusing such <x2M receptor-ligand-depleted blood into the bloodstream of a 
patient can be used to stimulate the immune response, perhaps by increasing access of HSP 
complexes to the a2M receptor. Alternatively, blood can be depleted of a2M receptor 
ligands, including HSPs, followed by the addition of HSPs or HSP antigenic peptide 
complexes to stimulate a specific immune response. Decreasing the levels of a2M receptor 
ligands can be used to enhance a desired immune response in patients, such as patients with 
cancer and infectious disease. Such methods for depletion of a2M receptor ligands to the 
bloodstream are described in detail below. 

In various embodiments, extracorporeal procedures, such as transfusion and 
apheresis, may be used to stimulate an immune response by modulating <x2M receptor ligand 
levels in a patient's circulation or alternatively, depleting a2M receptor ligands including 
HSPs from the blood, followed by the selective addition of specific HSPs or HSP antigenic 
peptide complexes to the blood. For example, in one embodiment, apheresis techniques 
coupled with affinity column technology, are used to remove a2M receptor ligand from a 
patients blood, followed by the return the ligand-depleted blood into circulation. 

In another embodiment, apheresis techniques coupled with affinity chromatography 
techniques are used to remove a2M receptor ligand from a patient's blood followed by the 
selective addition of HSPs or HSP antigenic peptide complexes to the patient's blood, and 
return of the treated blood into the patient's circulation. 

Extraction of blood can be performed either manually or by any one of the common 
automated, electronically controlled "apheresis" systems such as the Autopheresis-CRTM. 
system (Baxter Healthcare Corporation, Fenwal Division, 1425 Lake Cook Road, Deerfield, 
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111. 60015). In a preferred embodiment, a blood separation apparatus is fluidly connected to a 
blood vessel of the patient by way of a blood extraction tube. A blood pump, such as a 
peristaltic pump, is positioned on the blood extraction tube to pump blood from the patient to 
a blood separation apparatus. An anticoagulant, such as heparin, can be added to the blood 

5 through a separate chamber that is in fluid communication with the apheresis system. 

Optionally, blood can be taken out of the apheresis system, treated to remove a a2M 
receptor ligand in the laboratory, and then put back into the apheresis system to be 
reintroduced to the patient. In another embodiment, the blood can be further separated into 
cellular components such that only a specific subset of cells (i.e. leukocytes) can be treated to 

IQ remove an a2M receptor ligand and returned to the patient or, alternatively, only the plasma 
can be treated to remove an <x2M receptor ligand and returned to the patient. In another 
embodiment, after the blood has been treated to remove an <x2M receptor ligand, HSPs are 
added back to the blood. 

In various embodiments, blood from a patient can be withdrawn manually and the 

25 cells can be separated by a standard laboratory blood cell collection device. After or during 
the cellular collection, the blood can be treated to remove an a2M receptor ligand. The cells 
can then be returned to the patient by an i.v. drip or by injection with a syringe. 

In one embodiment, transfusion/apheresis methods may be used to enhance an 
immune response, <x2M receptor ligands are removed from transfused blood of a patient in 

20 need of treatment for an immune disorder. In another embodiment, the a2M receptor ligand 
that is removed from the blood is not a heat shock protein. 

One example of such a method comprises the following steps: (1) withdrawing blood 
from a patient; (2) passing the patient's blood over an affinity column comprising a a2M 
receptor ligand-binding compound, such as an antibody specific for a <x2M receptor ligand, 

25 for a time period and under conditions sufficient to allow binding of a2M receptor ligand to 
the affinity column; (3) returning the a2M receptor-ligand depleted blood to the patient. 

In another embodiment, apheresis methods may be used to enhance an immune 
response by depleting a2M receptor ligands (including HSPs) followed by the addition of 
selective HSPs or HSP antigenic peptide complexes to the blood of a patient. 

30 An example of such a method comprises the following steps: (1) withdrawing blood 

from a patient; (2) passing the patient's blood over an affinity column comprising a a2M 
receptor-ligand-binding compound for a time period and under conditions sufficient to allow 
binding of the a2M receptor ligand to the affinity column; (3) adding HSPs or HSP antigenic 
peptide complexes to the ligand depleted blood; (4) returning the blood to the patient. 

35 Methods that can be used to remove a ligand from the blood include affinity 

chromatography, anion or cation exchange chromatography, phosphocellulose 
chromatography, immunoaffinity chromatography, hydroxyapatite chromatography, and 
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lectin chromatography. Affinity purification is based on the interaction between the 
compound on the affinity column and its binding partner. The principle of affinity 
chromatography is well known in the art. In one embodiment, a recombinantly expressed and 
purified (or partially purified) protein, such as a2M receptor, is covalently or non-covalently 

5 coupled to a solid support such as, for example, a chromatography column. The extracted 
blood from a patient can be run over such a column. The coupled protein will bind to the 
a2M receptor ligand and deplete the blood of the a2M receptor ligand. The depleted blood 
can then be returned to the patient. In another embodiment, an antibody specific to the 
ligand can be coupled to the chromatography column and the immunospecific binding of an 

10 antibody to the a2M receptor ligand can be used to deplete the blood of the <x2M receptor 
ligand. Alternatively, one of the many cation or anion exchange resins commonly used in 
the art can be used to deplete the blood of the a2M receptor ligand. 

In another embodiment, the present invention also includes a kit that comprises a 
solid phase chromatography column with a purified a2M receptor ligand binding molecule 

15 attached thereto. Such a kit can contain components necessary for extracorporeal removal of 
a2M receptor ligands from the blood of a patient in need of such treatment. 

Transfusion/apheresis methods may also be used in combination with other methods 
of immunotherapy. In one embodiment, for example, after depletion of non-HSP a2M 
receptor ligands as described above, HSP-antigenic peptide complexes may be delivered to a 

20 cancer patient, or a patient having an infectious disease, using the transfusion/apheresis 
methods, or other method. Using transfusion/apheresis, at the same time as HSP-antigenic 
peptide complexes are being delivered, a2M receptor ligands (other than HSPs) may be 
removed from the patient's blood, in order to stimulate the immune response against the 
HSP-antigenic peptide complex being delivered. Thus, the fransfusion/apheresis method 

25 makes it possible to accomplish both the delivery of HSP-antigenic peptide complexes and 
the removal of competing a2M receptor ligands in a single procedure. 

5.7 TARGET AUTOIMMUNE DISEASES 

30 Autoimmune diseases that can be treated by the methods of the present invention 

include, but are not limited to, insulin dependent diabetes mellitus (i.e., EDDM, or 
autoimmune diabetes), multiple sclerosis, systemic lupus erythematosus, Sjogren's 
syndrome, scleroderma, polymyositis, chronic active hepatitis, mixed connective tissue 
disease, primary biliary cirrhosis, pernicious anemia, autoimmune thyroiditis, idiopathic 

35 Addison's disease, vitiligo, gluten-sensitive enteropathy, Graves' disease, myasthenia gravis, 
autoimmune neutropenia, idiopathic thrombocytopenia purpura, rheumatoid arthritis, 
cirrhosis, pemphigus vulgaris, autoimmune infertility, Goodpasture's disease, bullous 
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pemphigoid, discoid lupus, ulcerative colitis, and dense deposit disease. The diseases set 
forth above, as referred to herein, include those exhibited by animal models for such 
diseases, such as, for example non-obese diabetic (NOD) mice for IDDM and experimental 
autoimmune encephalomyelitis (BAE) mice for multiple sclerosis. 
5 The methods of the present invention can be used to treat such autoimmune diseases 

by reducing or eliminating the immune response to the patients own (self) tissue, or, 
alternatively, by reducing or eliminating a pre-existing autoimmune response directed at 
tissues or organs transplanted to replace self tissues or organs damaged by the autoimmune 
response. 

10 

5.8 TARGET INFECTIOUS DISEASES 

The infectious diseases that can be treated or prevented using the methods and 
compositions of the present invention include those caused by intracellular pathogens such as 
viruses, bacteria, protozoans, and intracellular parasites. Viruses include, but are not limited 

j 5 to viral diseases such as those caused by hepatitis type B virus, parvoviruses, such as 
adeno-associated virus and cytomegalovirus, papovaviruses such as papilloma virus, 
polyoma viruses, and SV40, adenoviruses, herpes viruses such as herpes simplex type I 
(HSV-I), herpes simplex type II (HSV-D), and Epstein-Barr virus, poxviruses, such as 
variola (smallpox) and vaccinia virus, RNA viruses, including but not limited to human 

20 immunodeficiency virus type I (HTV-I), human immunodeficiency virus type II (HIV -II), 
human T-cell lymphotropic virus type I (HTLV-I), and human T-cell lymphotropic virus 
type II (HTLV-II); influenza virus, measles virus, rabies virus, Sendai virus, picornaviruses 
such as poliomyelitis virus, coxsackieviruses, rhinoviruses, reoviruses, togaviruses such as 
rubella virus (German measles) and Semliki forest virus, arboviruses, and hepatitis type A 

25 virus. 

In another embodiment, bacterial infections can be treated or prevented such as, but 
not limited to disorders caused by pathogenic bacteria including, but not limited to, 
Streptococcus pyogenes, Streptococcus pneumoniae, Neisseria gonorrhoea, Neisseria 
meningitidis, Corynebacterium diphtheriae, Clostridium botulinum, Clostridium perfringens, 

30 Clostridium tetani, Haemophilus influenzae, Klebsiella pneumoniae, Klebsiella ozaenae, 
Klebsiella rhinoscleromotis, Staphylococcus aureus, Vibno cholerae, Escherichia coli, 
Pseudomonas aeruginosa, Campylobacter (Vibrio) fetus , Campylobacter jejuni, Aeromonas 
hydrophila, Bacillus cereus, Edwardsiella tarda, Yersinia enterocolitica, Yersinia pestis, 
Yersinia. pseudotuberculosis, Shigella dysenteriae, Shigella flexneri, Shigella sonnei, 

35 Salmonella typhiimurium, Salmonella typhii, Treponema pallidum, Treponema pertenue, 
Treponema carateneum, Borrelia vincentii, Borrelia burgdorferi, Leptospira 
icterohemorrhagiae, Mycobacterium tuberculosis, Toxoplasma gondii, Pneumocystis carinii, 
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Francisella tularensis, Brucella abortus, Brucella suis, Brucella melitensis, Mycoplasma 
spp., Rickettsia prowazeki, Rickettsia tsutsugumushi, Chlamydia spp., and Helicobacter 
pylori. 

In another preferred embodiment, the methods can be used to treat or prevent 
5 infections caused by pathogenic protozoans such as, but not limited to, Entomoeba 

histolytica, Trichomonas tenas, Trichomonas hominis, Trichomonas vaginalis, Trypanosoma 
gambiense, Trypanosoma rhodesiense, Trypanosoma cruzi, Leishmania donovani, 
Leishmania tropica, Leishmania braziliensis, Pneumocystis pneumonia, Plasmodium vivax, 
Plasmodium falciparum, and Plasmodium malaria. 

10 

5.9 TARGET PROLIFERATIVE CELL DISORDERS 

With respect to specific proliferative and oncogenic disease associated with HSP- 
a2M receptor activity, the diseases that can be treated or prevented by the methods of the 
present invention include, but are not limited to: human sarcomas and carcinomas, e.g., 
15 fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, 
angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendotheliosarcoma, 
synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon 
carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell 
carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland 
20 carcinoma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary 
carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, 
choriocarcinoma, seminoma, embryonal carcinoma, Wilms' tumor, cervical cancer, testicular 
tumor, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial carcinoma, 
glioma, astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma, 
25 hemangioblastoma, acoustic neuroma, oligodendroglioma, meningioma, melanoma, 
neuroblastoma, retinoblastoma; leukemias, e.g., acute lymphocytic leukemia and acute 
myelocytic leukemia (myeloblastic, promyelocyte, myelomonocytic, monocytic and 
erythroleukemia); chronic leukemia (chronic myelocytic (granulocytic) leukemia and chronic 
lymphocytic leukemia); and polycythemia vera, lymphoma (Hodgkin's disease and non- 
30 Hodgkirfs disease), multiple myeloma, Waldenstrom's macroglobulinemia, and heavy chain 
disease. 

Diseases and disorders involving a deficiency in cell proliferation or in which cell 
proliferation is desired for treatment or prevention, and that can be treated or prevented by 
inhibiting the a2M receptor function, include but are not limited to degenerative disorders, 
35 growth deficiencies, hypoproliferative disorders, physical trauma, lesions, and wounds; for 
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example, to promote wound healing, or to promote regeneration in degenerated, lesioned or 
injured tissues, etc. 

5.10 PHARMACEUTICAL PREPARATIONS AND METHODS OF 
ADMINISTRATION 

5 

The compounds that are determined to affect a2M receptor gene expression or gene 
product activity can be administered to a patient at therapeutically effective doses to treat or 
ameliorate a cell proliferative disorder. A therapeutically effective dose refers to that amount 
of the compound sufficient to result in amelioration of symptoms of such a disorder. 

10 

5.10.1 EFFECTIVE DOSE 

Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the 
LD 50 (the dose lethal to 50% of the population) and the ED 50 (the dose therapeutically 

1 5 effective in 50% of the population). The dose ratio between toxic and therapeutic effects is 
the therapeutic index and it can be expressed as the ratio LD 50 /ED 50 . Compounds that exhibit 
large therapeutic indices are preferred. While compounds that exhibit toxic side effects may 
be used, care should be taken to design a delivery system that targets such compounds to the 
site of affected tissue in order to minimize potential damage to uninfected cells and, thereby, 

2Q reduce side effects. 

The data obtained from the cell culture assays and animal studies can be used in 
formulating a range of dosage for use in humans. The dosage of such compounds lies 
preferably within a range of circulating concentrations that include the ED 50 with little or no 
toxicity. The dosage may vary within this range depending upon the dosage form employed 

25 and the route of administration utilized. For any compound used in the method of the 
invention, the therapeutically effective dose can be estimated initially from cell culture 
assays. A dose may be formulated in animal models to achieve a circulating plasma 
concentration range that includes the IC 50 (i.e., the concentration of the test compound that 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such 

30 information can be used to more accurately determine useful doses in humans. Levels in 
plasma may be measured, for example, by high performance liquid chromatography. 



35 
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5.10.2 FORMULATIONS AND USE 

Pharmaceutical compositions for use in accordance with the present invention may be 
formulated in conventional manner using one or more physiologically acceptable carriers or 
excipients. 

5 Thus, the compounds and their physiologically acceptable salts and solvates may be 

formulated for administration by inhalation or insufflation (either through the mouth or the 
nose) or oral, buccal, parenteral or rectal administration. 

For oral administration, the pharmaceutical compositions may take the form of, for 
example, tablets or capsules prepared by conventional means with pharmaceutically 

JO acceptable excipients such as binding agents (e.g., pregelatinised maize starch, 
polyvinylpyrrolidone or hydroxypropyl methylcellulose); fillers (e.g., lactose, 
microcrystalline cellulose or calcium hydrogen phosphate); lubricants (e.g, magnesium 
stearate, talc or silica); disintegrants (e.g, potato starch or sodium starch glycolate); or 
wetting agents (e.g., sodium lauryl sulphate). The tablets may be coated by methods well 

15 known in the art. Liquid preparations for oral administration may take the form of, for 
example, solutions, syrups or suspensions, or they may be presented as a dry product for 
constitution with water or other suitable vehicle before use. Such liquid preparations may be 
prepared by conventional means with pharmaceutically acceptable additives such as 
suspending agents (e.g., sorbitol syrup, cellulose derivatives or hydrogenated edible fats); 

20 emulsifying agents (e.g., lecithin or acacia); non-aqueous vehicles (e.g., almond oil, oily 
esters, ethyl alcohol or fractionated vegetable oils); and preservatives (e.g, methyl or propyl- 
p-hydroxybenzoates or sorbic acid). The preparations may also contain buffer salts, 
flavoring, coloring and sweetening agents as appropriate. 

Preparations for oral administration may be suitably formulated to give controlled 

25 release of the active compound. 

For buccal administration the compositions may take the form of tablets or lozenges 
formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
invention are conveniently delivered in the form of an aerosol spray presentation from 

30 pressurized packs or a nebuliser, with the use of a suitable propellant, e.g., 

dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide 
or other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined 
by providing a valve to deliver a metered amount. Capsules and cartridges of e.g., gelatin for 
use in an inhaler or insufflator may be formulated containing a powder mix of the compound 

35 and a suitable powder base such as lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g., by 
bolus injection or continuous infusion. Formulations for injection may be presented in unit 
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dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in oily or aqueous 
vehicles, and may contain formulatory agents such as suspending, stabilizing and/or 
dispersing agents. Alternatively, the active ingredient may be in powder form for 
5 constitution with a suitable vehicle, e.g. , sterile pyrogen-free water, before use. 

The compounds may also be formulated in rectal compositions such as suppositories 
or retention enemas, e.g., containing conventional suppository bases such as cocoa butter or 
other glycerides. 

In addition to the formulations described previously, the compounds may also be 
IQ formulated as a depot preparation. Such long acting formulations may be administered by 
implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or 
hydrophobic materials (for example as an emulsion in an acceptable oil) or ion exchange 
resins, or as sparingly soluble derivatives, for example, as a sparingly soluble salt. 
15 The compositions may, if desired, be presented in a pack or dispenser device that may 

contain one or more unit dosage forms containing the active ingredient. The pack may for 
example comprise metal or plastic foil, such as a blister pack. The pack or dispenser device 
may be accompanied by instructions for administration. 



20 

6. EXAMPLE: IDENTIFICATION OF a2M RECEPTOR AS AN HSP RECEPTOR 

6.1 INTRODUCTION 

The Example presented herein describes the successful identification of an interaction 
25 between gp96, hsp90, hsp70, and calreticulin with the <x2M receptor present in macrophages 
and dendritic cells. The experiments presented herein form the basis for isolating a2M 
receptor polypeptides and for the screening, diagnostic, and therapeutic methods of the 
present invention. 

The Applicant of the present invention noted that certain observations were 
30 inconsistent with a "direct transfer" model of HSP-chaperoned peptide antigen presentation. 
First, the immunogenicity of HSP preparations is dependent on the presence of functional 
phagocytic cells but not B cells or other nonprofessional antigen-presenting cells, (Udono 
and Srivastava, 1993, supra; Suto and Srivastava, 1995, supra), whereas free peptides can 
sensitize all cell types. Second, extremely small quantities of HSP-peptide complexes were 
35 effective in eliciting specific immunity, i.e., gp96-chaperoned peptides are several hundred 
times as effective as free peptides in sensitizing macrophages for CTL recognition, 
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suggesting the possibility of a specific uptake mechanism. Third, gp96-chaperoned peptides 
elicited an MHC I response that was not limited by the size of peptide. Finally, the 
processing of gp96-peptide complexes in macrophage was found to be sensitive to Brefeldin 
A (BFA), which blocks transport through the Golgi apparatus, suggesting that processing 

5 occurred through an intercellular mechanism. These observations led to the hypothesis that 
HSP-chaperoned peptides may be processed internally and re-presented by MHC class I 
molecules on the cell surfaces of macrophages (Suto and Srivastava, 1995, supra). There is 
also the hypothesis that the mannose receptor is used in the uptake of gp96 but no 
mechanism has been proposed for the non-glycosylated HSPs, such as HSP70 (Ciupitu et al. 9 

IQ 1998, J. Exp. Med., 187: 685-691). Others suggested that a novel intracellular trafficking 
pathway may be involved for the transport of peptides from the extracellular medium into the 
lumen of ER (Day et aL, 1997, Proc. Natl. Acad. Sci. 94:8065-8069; Nicchitta, 1998, Curr. 
Opin. in Immunol 10:103-109). Further suggestions include the involvement of phagocytes 
which (a) possess an ill-defined pathway to shunt protein from the phagosome into the 

15 cytosol where it would enter the normal class I pathway; (b) digest ingested material in 
lysosomes and regurgitate peptides for loading on the surface to class I molecules (Bevan, 
1995, J. Exp. Med. 192:639-41). The discovery of a receptor for heat shock proteins as 
disclosed herein helps to resolve the paradox of how extracellular antigenic peptides 
complexed to HSPs can be presented by MHC class I molecules on antigen presenting cells. 

20 

6.2 MATERIALS AND METHODS 

Mice, cells, and reagents. C57B1/6, BALB/c and TAP(-/-) mice were obtained from 
Jackson laboratories. Bone marrow-derived DCs were generated from the femurs and tibia of 
C57BL/6 mice. The bone marrow was flushed out and the leukocytes obtained and cultured 

25 as described (Lutz et a/., 1999, J. Immunol. Methods 223:77-92) in complete RPMI1640 with 
10% heat inactivated FCS and 20ng/ml GMCSF (Endogen Inc., Woburn, MA) for 6 days. 
On day 3 fresh media with GMCSF was added to the plates for the day 6 cultures. 
Macrophages were obtained from PEMs of pristaned mice by positive selection for CD1 lb+ 
cells (Miltenyi Biotech Inc.). RAW264.7 was gift of Dr. Christopher Nicchitta. A20.25 was 

30 gift of Dr. Lawrence Kwak. All other cell lines were obtained from ATCC. Proteasome 
inhibitor Lactacystin was purchased from Kamiya Inc. Japan. Anti-CD91 antibody (clone 
5 A6) was purchased from PRAGEN (Heidelberg). Anti-hsp70 (clone N27F3) and anti-PDI 
(clone 1D3) antibodies were purchased from StressGen (Victoria, Canada). 

Purification of HSPs. HSPs were purified as described (Srivastava, P.K., 1997, 

35 Methods: A companion to Methods in Enzymology 12:165-171; Basu and Srivastava, 1999, 
J. Exp. Med. 189(5):797-802). All buffers used for purifications were prepared with 
endotoxin free water (Nanopure Infinity UV/UF, Barnstead/Thermolyne, Dubuque, IA) and 
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all glasswares used for purification were cleaned with endotoxin free water and baked in a 
4000F oven (Gruenberg, WUiamsport, PA). The HSP-containing fractions were identified by 
immunoblots. 

Conjugation of proteins to FITC and staining of cells. Purified proteins were 
5 conjugated to FITC using the FluoroTag FITC conjugation kits (SIGMA) as per the 

manufacturers protocol. Conjugation was confirmed by a 2kDa increase in molecular weight 
by SDS-PAGE and by immunoblotting with an anti-FITC monoclonal antibody. Incubations 
. of indicated amounts of FITC-tagged proteins and cells were done in the presence of 1% 
nonfat dry milk (Carnation®) in PBS for 20min at 4°C. After repeated washing, cells were 
10 analyzed by flow cytometry (Becton Dickenson, La Jolla, Califronia). Cells were also labeled 
with propidium iodide just before FACScan analysis. Cells staining positive for propidium 
iodide were gated out of the events. No differences were observed in the binding of HSPs to 
Mac-1+ cells from pristaned or non-pristaned mice. Fixed or unfixed cells were labeled with 
FITC-tagged HSP as above. Labeled cells were visualized using a Zeiss LSM confocal 
15 microscope. 

Affinity chromatography. Proteins (lmg) in 2ml volume were incubated with 2ml of 
equilibrated AminoLink beads in PBS with a reductant (NaCNBH 3 ) for 1 hour. Uncoupled 
protein was removed by extensive washing of the column and unreactive groups quenched. 
Immobilization yields were typically >92% of the starting amount of protein. Columns were 

2Q stored at 4°C until used. Such columns were made with gp96 (purified as described in 
Srivastava et al., 1986, Proc. Natl. Acad. Sci., U.S.A. 83:3407-341 1) and albumin. For 
membrane purification, cells were lysed by dounce homogenization in hypotonic buffer 
containing PMSF. Unlyzed cells and nuclei were removed by centrifiigation at lOOOg for 5 
mm. The postauclear supernatant was centrifuged at 100,000g for 90 mins. The pellet 

25 contains total membranes and was fractionated by aqueous two-phase partition with a 
dextran/polyethylene glycol biphase. Briefly membranes were resuspended in PEG (33% 
wt/wt in 0.22 M sodium phosphate buffer, pH 6.5) and underlaid gently with dextran 
(20%wt/wt in 0.22M sodium phosphate buffer, pH 6.5). The two phases were mixed gently 
and centrifuged at 2000 g for 15 mins. The white material at the interphase was enriched for 

3Q plasma membranes, whose proteins were extracted by 2 hr incubation in 20mM Tris buffer 
(pH8, containing 0.08% octylglucoside) at 4°C. 

Photo cross-linking of gp96 to putative receptor. The cross-linker (S ASD, (Pierce) 
was labeled with I 125 using iodobeads (Pierce). Radiolabeled SASD was covalently attached 
to gp96 by incubation at room temperature for 1 hr. Free SASD and I 125 were removed by 

35 size exclusion column (KwikSep columns, Pierce). For cross-linking studies, I 125 -SASD- 
gp96 (50 |ig gp96) was incubated with purified CD1 lb + cells. Unbound protein was 
removed by washing. All procedures to this point were carried out in very dim light. 
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Proteins were cross-linked with UV light Cells were lysed with lysis buffer (0,5%NP4O, 
lOmM Tris, ImMEDTA, 150mM NaCl) and treated with 100 mM 2-mercaptoethanol to 
cleave the cross-linker. Cell ly sates were analyzed by SDS-PAGE and autoradiography. 
Re-presentation assays. Re-presentation assays were carried out as described (Suto 

5 and Srivastava, 1995, Science 269:1585-1588). Antigen presenting cells (RAW264.7 
macrophage cell line) were plated at a 1:1 ratio with AH I -specific T cells in complete 
RPML Approximately 10,000 cells of each type were used. Gp96 (10 ng/ml) chaperoning 
the AH1-20 mer peptide (RVTYHSPSYVYHQFERRAK) was added to the cells and the 
entire culture was incubated for 20 hrs. Stimulation of T cells was measured by quantifying 

10 the amount of IFN-Y released into the supematants by ELIS A (Endogen). In addition, 
CD1 lb+ peritoneal exudate cells (1X104) were pulsed with HSPs purified from liver, or 
HSP-peptide complex generated in vitro and relevant CD8+ T cells (VSV8 specific CTL line 
or AHl-specific CTL clones, as indicated) were added to the cultures. The assay was carried 
out in 250 ml volume in 96-well plates with RPMI medium containing 5% FCS at 370C for 

15 20 hours. Culture supematants were harvested and tested for the presence of IFN-y release by 
ELISA (Endogen Inc., Woburn, MA). 

Complexing in vitro of peptide to HSPs. HSPs were mixed with VSV19 or AH1-19 
in a 50 : 1 peptide to protein molar ratio in 0.7M NaCl in Na - phosphate buffer and heated at 
500 C for 10 min., then incubated at room temperature for 30 min. Excess free peptide was 

2o removed with PBS using centricon 10 ( Amicon, Inc., Beverly MA). 

Purification of CDllb+ cells. CD1 lb+ cells were selected using the MACS columns 
and protocols supplied by Miltenyi Biotec Inc. Auburn, California. CD1 lb antibody, 
supplied as CD1 lb MicroBeads, was purchased from Miltenyi Biotec Inc., and has been 
demonstrated not to activate CD1 lb+ cells with regard to the markers tested in this 

25 experiment. 

Induction of cytotoxic T cells. C57BL/6 mice were immunized intraperitoneally with 
50 mg of gp96 complexed with VSV19 peptide. Ten days later, recipient spleens were 
removed and splenocytes were stimulated with VSV8 synthetic peptide at ImM 
concentration. After 5 days, MLTCs were tested for cytotoxicity in a chromium release assay 

30 using EL4 cells alone and EL4 cells pulsed with VSV8 peptide as targets. 

Protein Microsequencing. Proteins identified by affinity chromatography were 
analyzed on SDS-PAGE and stained with coomasie blue or transferred onto PVDF 
membrane and stained with coomasie blue, all of it under keratin-free conditions. Protein 
bands were excised with a razor from the gel or membrane. Tryptic peptides from an 80kDa 

35 faint coomassie band were extracted by 50% acetonitrile, 5% formic acid, dried, and loaded 
onto a 75 m 10 cm, reverse-phase CI 8, microcapillary column (3 jxl vol) and tryptic peptides 
were separated by on-line microcapillary liquid chromatography-tendem mass spectrometry 
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followed by database searching using the SEQUEST program as previously described. 
(Gatlin et a/.,2000, Anal. Chem. 72:757-63; Link et aL, 1999, Nat. Biotechnol. 17:676-82). 
The analysis was carried out in a data-dependent auto-MS/MS fashion using a Finnigan LCQ 
iontrap Mass Spectrometer. 



6.3 RESULTS 

Identification of an 80 kDa protein as a potential gp96 receptor. Homogenous 
preparations of gp96 were coupled to FITC and the gp96-FITC was used to stain RAW264.7 
cells, shown to be functionally capable of re-presenting gp96-chaperoned peptides. Gp96- 

10 FITC but not control albumin-FITC preparations stained the cell surface of RAW264.7 cells 
(FIG. 1 A). Plasma membrane preparations of cell surface-biotinylated RAW264.7 cells were 
solubilized in 0.08% octyl-glucoside and the soluble extract was applied to a gp96-Sepharose 
column. The bound proteins were eluted with 3M sodium chloride. SDS-PAGE analysis of 
the eluate showed 2 major bands of -75-80 kDa size (FIG. IB, top left). Blotting of this gel 

1 5 with avidin-peroxidase showed that both bands were biotinylated, indicating their surface 
localization (FIG. IB, bottom left). Affinity purification of membrane extracts of RAW264.7 
cells over control serum albumin affinity columns did not result in isolation of any proteins, 
nor did probing of immunoblots of such gels with avidin peroxidase detect any albumin- 
binding surface proteins (FIG. IB, top and bottom center lanes). As an additional control, 

20 chromatography of membrane extracts of P815 cells which do not bind gp96-FITC and 

which do not re-present gp96-chaperoned peptides, on gp96 affinity columns did not result in 
elution of any gp96-binding proteins (FIG. IB, top and bottom right lanes). 

In parallel experiments, gp96 molecules were coupled to the radio-iodinated linker 
sulfosuccinimidyl (4-azidosalicylamido) hexanoate (S ASD) which contains a photo cross- 

25 linkable group. Gp96-S ASD-I 125 was pulsed onto peritoneal macrophages, which have been 
shown previously to re-present gp96-chaperoned peptides (Suto and Srivastava, 1995, 
Science 269:1585-1588). Excess gp96-SASD was removed by multiple rounds of washing 
of the cells and photoactivation was carried out by exposure of cells to UV light for 10 min. 
Cell lysates were reduced in order to transfer the I 125 group to the putative gp96 ligand and 

30 were analyzed by SDS- PAGE followed by autoradiography. The gp96 molecule was 
observed to cross-link to an -80 kDa band specifically present in re-presentation-competent 
macrophage but not in the re-presentation-incompetent P815 cells (FIG. 1C). This band 
appears to correspond in size to the larger of the two bands seen in eluates of gp96 affinity 
columns (FIG. IB). No band corresponding to the lower band in that preparation is seen in 

35 the photo cross-linked preparation. The observation of a specific binding of gp96to an 80 
kDa protein in two different re-presentation-competent cell types, but not in a re- 
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presentation-incompetent cell line, and by two independent assays supported the candidacy 
of the 80 kDa molecule for the gp96 receptor. 

Antiserum against the 80 kDa protein inhibits re-presentation of a gp96-chaperoned 
antigenic peptide. The eluates containing the 75-80 kDa proteins were used to immunize a 

5 New Zealand white rabbit, and pre-immune and immune sera were used to probe blots of 
plasma membrane extracts of the re-presentation-competent RAW264.7 and primary 
peritoneal macrophages and the re-presentation-incompetent P815 cells. The immune but not 
the pre-immune serum detected the 80 kDa band (and a faint lower 75 kDa band) in plasma 
membrane extracts of primary macrophage and the RAW264.7 membranes but not of P815 
cells (FIG. 2A). The pre-immune and immune sera were tested in a functional assay for their 
ability to block re-presentation of gp96-chaperoned peptides. The L d -restricted epitope AH1 
derived from the gp70 antigen of murine colon carcinoma CT26 (Huang et al. 9 1996, Proc. 
Natl. Acad. Sci. U.S.A. 93:9730-9735) was used as the model system. Complexes of gp96 
with an AH1 precursor (used to inhibit direct presentation) were pulsed onto RAW264.7 

15 cells which were used to stimulate a L d /AHl-specific CD8+ T cell clone. Release of 
interferon-^ by the clones was measured as a marker of their activation. RAW264.7 cells 
were able to re-present gp96-chaperoned AH1 precursor effectively in this assay. It was 
observed that at the highest concentration, the immune sera inhibited re-presentation 
completely (FIG. 2B). Although the pre-immune serum was ineffective in inhibiting re- 

20 presentation as compared to the immune sera, it did inhibit re-presentation significantly at 
higher concentrations. The significance of this observation became clear later when we 
determined the identity of the gp96 receptor. Repeated immunizations with the affinity- 
purified gp96-binding proteins did not result in corresponding increase in antibody titers. 

Identification of the 80 kDa protein as an amino terminal fragment of the heavy chain 

25 of the a2M receptor. The 80 kDa protein eluted from the gp96 affinity column was resolved 
on SDS-PAGE and visualized by staining with Coomassie Brilliant Blue. The protein band 
was subjected to in-gel trypsin digestion and mass spectrometry-based protein 
microsequencing as described in the methods in Section 6.2. Four independent tryptic 
peptides corresponding to N-terminal region of the a 2-macroglobulin (a2M) receptor, 

30 designated by immunologists as CD91, were identified (FIG. 3C). 

a2M inhibits re-presentation of a gp96-chaperoned antigenic peptide by RA W264. 7. 
a2M receptor is one of the known natural ligands for the <x2M receptor. Its ability to inhibit 
re-presentation of gp96-chaperoned antigenic peptide AH1 was tested in the assay described 
in FIG. 2. a2M but not control proteins selectin (CD62) or serum albumin was observed to 

35 inhibit re-presentation completely and titratably (FIG. 4). This observation was also 
consistent with the result in FIG. 2 that while the pre-immune serum did not detect an 80 
kDa band in plasma membranes of RAW264.7 cells, it did inhibit re-presentation to some 
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degree at high concentrations. Thus, by structural as well as functional criteria, the a2M 
receptor was determined to fulfill the criteria essential for a receptor for gp96. 

Binding of fluorescence-labeled HSPs and c^macroglobulin to a panel of primary 
and cultured cells. FITC-labeled HSPs, gp96, hsp90 or hsp70, or control non-HSP serum 

5 albumin (S A) were incubated with primary cells such as pristane-induced peritoneal 
macrophage, differentiated bone marrow-derived dendritic cells or with immortalized cell 
lines such as RAW264.7, RAW309Cr.l of macrophage origin, P815 mastocytoma, YAC-1 
lymphoma, EL4 thymoma, Meth A and PS-C3H fibrosarcomas, B16 melanoma, CT26 colon 
carcinoma, and UV6139 squamous celt carcinoma, as described in the Methods. After 

10 removal of unbound protein by extensive washing, cells were analyzed by flow cytometry. 
As shown in Figure 5, the peritoneal macrophages and the bone marrow-derived dendritic 
cells showed robust binding of each of the three HSPs but not albumin. However, of the two 
macrophage cell lines, only one of them, RAW264.7, bound the three HSPs. 
RAW309Cr.l.did not bind any of the HSPs (FIG. 6A and 6B). Out of 8 other cell lines tested 

1 5 with the FITC-labeled gp96, hsp90 and hsp70, none was observed to bind to HSP in a 
manner comparable to the binding observed with RAW264.7. YAC 1 was observed to bind 
hsp70 but only to a significantly smaller degree. The binding was only a fraction of that 
observed with APCs. 



As described above, the a2 macroglobulin receptor has been identified as the receptor 



20 for gp96. All of the cell types in Figure 5 were also tested for the presence of CD91 by 
staining with FITC-a2 macroglobulin. CD91 showed precisely the same pattern of 
distribution as did each of the three HSPs (FIG. 5), 

The ability of cells to bind HSPs and ajvl correlates with the ability to represent 
gp96-chaperoned peptides. We tested if the ability of a particular cell type to bind HSPs or 

25 macroglobulin as shown in Figure 5 correlates with its ability to re-present gp96- 

chaperoned peptides. Re-presentation studies are done typically by incubating APCs and an 
HSP, chaperoning a known peptide, with T cells specific for an epitope present in the 
chaperoned peptide (Suto and Srivastava,1995, supra). The experimental system is set up 
such that the peptide cannot charge directly onto MHC I but requires intracellular processing 

30 followed by presentation to T cells. VSV8 and AH1 antigenic systems were used in these 
studies. The VSV8 epitope (RGYVYQGL) is presented by the K b molecule and VSV19 
(SLSDL RGYVYQGLKSGNVS) is its extended variant, which cannot charge K b directly. 
AH1 (SPSYVYHQF) is an L d -restricted epitope of a murine leukaemia virus envelope 
protein gp70 (Huang et al.,1996), and AH1-19 (RVTYHSPSYVYHQEERRAK) is its 

35 extended version. Peritoneal macrophage and BM-DCs were tested side-by-side for re- 
presentation in the VSV8 system, and both cell types were able to re-present gp96- 
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chaperoned VSV19 to VSV8-specific T cells (FIG. 7A). EL4 and B16 cells, both of the b 
haplotype, were also tested and were found unable to re-present in identical assays (data not 
shown). The BM-DCs were observed to re-present gp96-chaperoned VSV19 significantly 
better than macrophage did; however, it is not possible to determine from the data if this 

5 difference derives from the better T cell stimulatory properties of DCs in general or whether 
the DCs are specifically more efficient than macrophage at re-presenting gp96-chaperoned 
peptides. The two macrophage cell lines RAW309Cr.l and RAW264.7 were tested for their 
re-presentation ability in the AH1 system. In parallel with the HSP and a2M-staining data 
(FIG. 5), RAW264.7 cells but not RAW309Cr.l were observed to be capable of re- 

10 presenting gp96-chaperoned AH1 peptides (Fig. 7B). 

Peptides chaperoned by hsp90, hsp70 and CRT are re-presented by MHC I molecules 
of APCs. Gp96 was the first HSP for which the re-presentation phenomenon was 
experimentally shown (Suto and Srivastava 1995, supra). Hsp70-chaperoned peptides have 
been shown recently to be re-presented by APCs (Castellino et al, 2000, J.Exp Med. 

15 191(1 1):1957-1964). The ability of other HSPs, hsp90 and CRT to introduce chaperoned 
peptides into the endogenous presentation pathway was tested in the AH1 system with 
RAW264.7 cells as the APCs. RAW264.7 cells were pulsed with hsp90, hsp70, calreticulin, 
or gp96, as a positive control, by themselves, or chaperoning the AH1-19 peptide. 
Chaperoning of peptides by the HSPs was accomplished in vitro as previously described 

20 (Blachere et al 1997, J.Exp. Med. 186:1315-1322; Basu and Srivastava 1999, J. Exp. 
Med.l89:797-802). T cells specific for L d /AH-1 secreted IFN-y when the RAW264.7 cells 
were pulsed with complexes of hsp90, hsp70, CRT or gp96 with AH149, but not when the 
HSPs were not complexed with the peptide (FIG. 8). Pulsing of RAW264.7 cells with AH1- 
19 alone did not lead to surface loading of L d molecules and consequent stimulation of T 

25 cells. Further, RAW264.7 cells pulsed with complexes of serum albumin with AH1-19, also 
failed to stimulate L d /AHl-specific T cells, thus indicating the specific requirement of HSPs 
for introducing the chaperoned peptides into the endogenous presentation pathway (FIG. 8). 

Gp96, hsp90 f hsp70 and CRT engage a common receptor. Does each HSP have a 
unique receptor or do they share a common receptor? This question was addressed by three 

30 independent criteria : by measuring re-presentation of gp96-chaperoned AH1-19 (as in FIGS, 
7 and 8) in the presence of excess and titrated quantities of free (i.e. not complexed to AH1- 
19) gp96, hsp90, hsp70 or serum albumin, by testing if macroglobulin, a known Ugand for 
CD91, a receptor for gp96, can inhibit re-presentation of peptides chaperoned by gp96, 
hsp90, hsp70 or CRT, and finally, if anti-CD91 antibody can inhibit re-presentation of 

35 peptides chaperoned by some or all the HSPs. 
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The gp96-AHl-19 complex was added to RAW264.7 cultures at a fixed final 
concentration of 40 ng/ml, while the competing HSPs or serum albumin were added at 
concentrations between (200-800) jig/ml. It was observed (FIG. 9A) that all 3 competing 
HSPs could inhibit re-presentation of gp96-chaperoned AH1-19, albeit with different 

5 efficiencies. Gp96 was able to compete with itself, while hsp90 was an even better 

competitor than gp96. Hsp70 was a less efficient competitor than gp96 but was a significant 
competitor. Albumin competed inefficiently. In quantitative terms, approximately 2 fold 
molar excess of hsp90, 6 fold molar excess of gp96, and a 13 fold molar excess of hsp70 
were required to inhibit by 50% the re-presentation of gp96-chaperoned peptides at a gp96 

1 0 concentration of 40 p,g/ml. All three HSPs were able to inhibit the re-presentation of gp96- 
chaperoned peptides completely at the highest concentration tested. This observation 
suggests that gp96, hsp90 and hsp70 utilize a single receptor albeit with differing 
specificities. 

In additional experiments, increasing quantities of oc 2 macroglobulin were added to 

15 re-presentation assays where AH1-19 chaperoned by gp96, hsp90, hsp70 or CRT was re- 

presented by RAW264.7 cells, to L d /AH-1 specific T cells, macroglobulin was observed 

to inhibit, in a titratable manner, re-presentation of peptides chaperoned by each of the four 

HSPs (FIG. 9B). Re-presentaion of peptides Chaperoned by gp96, hsp70 and CRT was 

inhibited equally, while re-presentation of hsp90-chaperoned peptide was inhibited the most 

20 effectively, and almost completely at the highest concentration of ct 2 macroglobulin tested. 

Serum albumin, when tested at the highest concentration, inhibited re-presentation only 

modestly. It may be noted that while the data in Fig. 9A show that the specific peptide- 

deficient HSPs inhibited re-presentation of gp96-AHl-19 complexes completely at the 

highest concentrations tested, ct 2 macroglobulin appears far less effective, in quantitative 
25 . . . 

terms, at inhibiting the re-presentation of peptides chaperoned by 3 of the 4 HSPs (FIG. 9B). 
However, this quantitative disparity disappears if one notes that the a 2 macroglobulin 
molecule is approximately 10 times larger in molecular mass than the average HSP molecule. 

A mouse monoclonal anti-CD91 IgGj antibody and isotype control antibodies were 
tested for their ability to inhibit re-presentation of peptides chaperoned by gp96, hsp90, 

30 

hsp70 and CRT. As before, the RAW264.7/AH1 system was utilized and the antibodies were 
added to re-presentation cultures at the concentrations indicated (Fig. 9C). Anti-CD91 
antibody was observed to inhibit, titratably and specifically, the re-presentation of AH1 
chaperoned by each of the 4 HSPs tested. The isotype control antibody did not inhibit re- 
presentation in any instance. Further, the inhibition mediated by the anti-CD91 antibody was 

35 

complete and uniform for each of the HSPs, indicating that CD91 is the sole receptor for 
each of the 4 HSPs. 
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Requirement of a Junctional proteasome complex for the representation ofgp96- 
chaperoned peptides by APCs. The re-presentation assay was carried out in presence or 
absence of the specific proteasome inhibitor, lactacystin. The peritoneal macrophages were 
treated or untreated with lactacystin for 2 hr and then cultured with gp96-VSV19 complex 

5 for another 2 hr in presence or absence of the inhibitor. The cells were chromium labeled at 
the same time for 1 hr and then washed and used as targets against CD8 + T cells specific for 
VSV8 in a 4 hr chromium release assay. Gp96-VSV19, lactacystin-untreated pulsed APCs 
were lyzed by VSV8-specific CD8 + T cells (FIG. 10A). As observed previously for gp96 
(Suto and Srivastava 1995, supra) and for hsp70 (Castellino et al, 2000, supra), only a small 

JO proportion of pulsed APCs were lyzed by the APCs even at the highest E:T ratio tested (FIG. 
10A). The APCs pulsed with VSV8 (through surface charging) were lyzed in a titratable and 
more significant degree, indicating that the APCs were entirely capable of being lyzed. The 
basis of the selective lyzability of APCs re-presenting HSP-chaperoned peptides is still 
unclear. However, and regardless of this observation, the lactacystin-treated, gp96-VSV19 

15 pulsed APCs were not recognized by the VSV8-specific CD8 + T cells and were not lyzed by 
them (FIG. 10A). Inhibition of proteasomal function thus inhibits the processing of peptides 
chaperoned by gp96 (FIG. 1 OA). As other HSPs tested also use the same portal of entry (FIG. 
9), it is assumed that inhibition of proteasome function interferes with processing of peptides 
chaperoned by them as well. The data recently reported by Castellino et al for hsp70 are 

2o consistent with this inference. 

Re-presentation of gp96~chaperoned peptides by MHC I of the APCs requires a 
Junctional TAP. The requirement of TAP in re-presentation of gp96 chaperoned peptides by 
APCs was tested. In a re-presentation assay in vitro, gp96 purified from liver or the same 
gp96 complexed with VSV19 was pulsed on to primary cultures of peritoneal macrophages 

25 derived from TAP +/+ or -/- mice. The pulsed APCs were used to stimulate CD8 + T cell lines 
specific for K b /VSV8. After incubation for 20 hr, the culture supernatants were tested for 
release of IFN-y as a marker for T cell stimulation (FIG. 1 0B). It was observed that APCs 
from TAP+/+ mice stimulated the CD8 + T cells specifically when cultured in presence of 
gp96 complexed to VSV19 but APCs from TAP1-/- mice were unable to do so. This result 

30 indicates that gp96-chaperoned peptides must enter the endoplasmic reticulum through the 
TAP molecules, for being loaded on the MHC I molecules. As other HSPs tested also use the 
same portal of entry (FIG. 9), it is assumed that peptides chaperoned by other HSPs also 
require TAP for re-presentation. Part of the data recently reported by Castellino et al for 
hsp70 are consistent with this inference. 

35 In studies in vivo, TAP1(-/-) (C57BL/6/SV129J) or wild type (C57BL/6) mice were 

immunized with the gp96-VSV19 complexes (50 jig of gp96 complexed with 50 |xg of 
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VSV19), or VSV19 alone, or gp96 alone. Spleen cells of immunized mice were cultured with 
the VSV8 and tested for cytotoxic activity on 51 Cr labeled EL4 cells or EL4 cells pulsed with 
the VSV8 peptide as targets. Spleen cells of wild type (C57BL/6) mice immunized with 
gp96-peptide complex showed VSV8-specific CTL activity whereas splenocytes of TAP 1 (- 

5 /-) mice immunized with gp96-peptide complex showed no cytotoxic activity (FIG. 10C). It 
may be argued that the lack of CTL activity in TAP-/- mice is a result of the poor loading 
and stability of MHC I molecules in general, rather than because of a specific block in re- 
presentation. While this is possible, and is difficult to entirely refute, we are easily able to 
generate VSV8-specific CTLs in TAP-A mice as in TAP+/+ mice by immunization with 

10 VSV8 peptide in incomplete Freund's adjuvant (data not shown). Sandberg et al. (1996) have 
reported similar data. In any case, the data from re-presentation assays in vitro using APCs 
from TAP+/+ and -/- mice (FIG. 10B) demonstrate the TAP requirement for re-presentation 
convincingly and without the complexity introduced by the experiment in vivo (FIG. 10C). 

15 

6.4 DISCUSSION 

The a2M receptor, which is also designated CD91, was initially identified as a 
protein related to the low density lipoprotein (LDL) receptor Related Protein (LRP) 
(Strickland et al, 1990, J. Biol. Chem.265: 17401 -17404; Kristensen et al, 1990, FEBS Lett. 

20 276:151-155). The protein consists of an ~420 kDa a subunit, an 85 kDa p subunit and a 39 
kDa tightly associated molecule (RAP). The a and P subunits are encoded by a single 
transcript of ~15 Kb in size (Van Leuven et al, 1993, Biochim. Biophys. Acta. 1173:71-74. 
The receptor has been shown to be present in cells of the monocytic lineage and in 
hepatocytes, fibroblasts and keratinocytes. CD91 has been shown previously to bind the 

25 activated form of the plasma glycoprotein a2M, which binds to and inhibits a wide variety of 
endoproteinases. <x2M receptor also binds to other ligands such as transforming growth 
factor P (O Connor-McCourt et al 9 1987, J. Biol. Chem. 262:14090-14099), platelet-derived 
growth factor (Huang et al, 1984, Proc. Natl. Acad. Sci. U.S.A. 81:342-346), and fibroblast 
growth factor (Dennis et al, 1989, J. Biol. Chem. 264:7210-7216). <x2M is thus believed to 

30 regulate, and specifically diminish, the activities of its various ligands. Complexed with 
these various ligands, <x2M binds a2M receptor on the cell surface and is internalized through 
receptor-mediated endocytosis. Uptake of a2M-complexed ligands has been assumed thus 
far to be the primary function of the a2M receptor, although a role for it in lipid metabolism 
is also assumed. a2M receptor ligands other than a2M, such as tissue-specific plasminogen 

35 activator-inhibitor complex (Orth et al, 1992, Proc. Natl. Acad. Sci. U.S.A. 89:7422-7426) 
and urokinase-PAIl complex (Nykjaer et al, 1992, J. Biol. Chem. 267:14543-14546), have 
been identified. These ligands attest to a role for a2M receptor in clearing a range of 



-84- 



WO 01/92474 



PCT/US01/18041 



extracellular, plasma products. 

The studies reported here show that the heat shock proteins gp96, hsp90, hsp70, and j 
calreticulin are additional ligands for the a2M receptor. The human gp96-coding gene has 
been mapped previously by us at chromosome 12 (q24.2-*q24.3) (Maki et al 9 1993, Somatic j 

5 Cell Mol. Gen. 19:73-81). It is of interest in this regard that the <x2M receptor gene has been 
mapped to the same chromosome and at a not too distant location (ql3-+ql4) (Hilliker et al 
Genomics 13:472-474). Gp96 binds <x2M receptor directly and not through other ligands 
such as a2M. Homogenous preparations of gp96, in solution, or cross-linked to a solid 
matrix, bind to the a2M receptor. Indeed, the major ligand for the <x2M receptor, <x2M, 

10 actually inhibits interaction of gp96 with a2M receptor, instead of promoting it, providing 
evidence that gp96 is a direct ligand for the <x2M receptor. The 80 kDa protein shown to 
bind gp96 is clearly an amino terminal degradation product of the a subunit of the a2M 
receptor. Degradation products of the a2M receptor in this size range have also been j 
observed in previous studies (Jensen et al, 1989, Biochem. Arch. 5:171-176), and may 

1 5 indicate the existence of a discrete ectodomain in the <x2M receptor which may be ] 

i 

particularly sensitive to proteolytic cleavage. 

The studies shown here also indicate that the a2M receptor is also engaged by hsp90, 
hsp70 and calreticulin. This observation is surprising in light of the fact that hsp70, , m 
calreticulin and hsp90/gp96 have no obvious structural similarities with each other. In 

20 another context, HSPs have presented us with this dilemma before: many of the various 

. V 

HSPs have no obvious homologies with each other and yet they appear to bind many of the j 
samepeptides (Ishii etal, 1999, J. Immunol. 162(3): 1303-1 309; Breloer etal, 1998, Eur. J. ^ 
Immunol. 28(3):1016-1021). It remains to be seen if gipl70, which belongs to the extended 
hsp70 family and hspl 10, which has no homology to any of the other HSPs, shall share the 

25 CD91 receptor. The multiple common properties of the HSPs which share the Fourth 
Paradigm (SrivastavaP.K., 1994, Experientia 50(1 1-12): 1054-1060), i.e. peptide-binding, 
interacting with APCs through a common receptor, ATP-binding and ATPase activity, % 
strongly suggest that these molecules must share conformational similarities which are not 
obvious from their primary sequence. Crystallographic analyses of the HSPs have begun to 

30 reveal their structure (Zhu et al, 1996, Science 272: 1602-1614; Prodromou et al, 1997, Cell 
90:65-75; Stebbins et al, 1997, Cell 89:239-250), and shall shed light on this question. 

The observations that <x2 macroglobulin and anti-CD91 antibodies inhibit 
re-presentation by each of the four HSPs completely, indicate that CD91 is the only receptor 
for the 4 HSPs. Considering the increasingly obvious role which the HSPs play in innate 

35 (Basu et al, 2000, Int. Immunol. 12(1 1):1539-1546) and adaptive immune response, this 
observation is somewhat counter-intuitive. However, the data on complete inhibition by two 
independent means (FIG.. 9) are quite compelling. We have noticed earlier, and we report 
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here, significant differences between hsp70 and hsp90/gp96 in their ability to compete for 
binding to gp96 receptors (Binder et al, 2000, J. Immunol. 165:2582-2587). Another group 
has also observed similar differences between gp96 and hsp70 (Amold-Schild et al, 1999, 
162:3757-3760). These differences are not inconsistent with our present report pointing to a 

5 single receptor for the 4 HSPs. They simply suggest that the various HSPs interact with a 
single receptor with widely differing affinities. Castellino et al have recently demonstrated 
re-presentation ofhsp70-chaperoned peptides by APCs through receptor-mediated uptake 
and have suggested the existence of receptors of different affinity classes for single HSPs. 
This argument is biologically cogent, but is not supported by our present data. 

10 Once the HSP-peptide complex binds to the receptor, peptides chaperoned by the 

HSPs must enter the APC along with the HSP. The studies reported here address the 
downstream events solely with respect to gp96 in the assumption that if all HSPs enter 
through the same portal, the downstream events must be identical or similar for peptides 
chaperoned by each of them. Our observations suggest that the peptides go from the 

15 endosome to the cytosol, to the ER and then to the secretory pathway to be re-presented on 
the surface. The transit through the cytosol is established through the proteasome 
requirement as well as through the TAP requirement of re-presentation. There is no known 
mechanism for transit of molecules from vesicular to soluble compartment although 
precedents certainly exist (Chiang et al, 1989, Science 246:382-385). Exploration of this 

20 pathway shall, without doubt, open a new window into our understanding of intracellular 
traffic of proteins. Castellino et al. have reported recently on the events as they occur 
downstream of receptor-mediated uptake of hsp70-peptide complexes by APCs (Castellino et 
al, 2000, supra). Our observations with a different HSP (gp96) are entirely consistent with 
that version of events and buttress the notion that the same portal of entry is used by all the 

25 peptide-chaperoning HSPs for re-presentation. 

As shown here, the heat shock protein-a2M receptor interaction provides a new type 
of function for <x2M receptor, a function of a sensor, not only of the extracellular 
environment with its previously known plasma-based ligands, but also a sensor of the 
intracellular milieu as well. HSPs such as gp96 are obligate intracellular molecules and are 

30 released into the extracellular milieu only under conditions of necrotic (but not apoptotic) 
cell death. Thus, the a2M receptor may act as a sensor for necrotic cell death (see FIG. 1 1), 
just as the scavenger receptor CD36 and the recently identified phosphatidyl serine-binding 
protein act as sensors of apoptotic cell death and receptors for apoptotic cells (Savill et al, 
1992, J. Clin. Invest.90:1513-1522; Fadok et al, 2000, Nature 405:85-90). Interaction of the 

35 macrophages with the apoptotic cells leads to a down-regulation of the inflammatory 

cytokines such as TNF (Fadok et al, 2000, supra), while gp96-APC interaction leads to re- 
presentation of gp96-chaperdned peptides by MHC I molecules of the APQ followed by 
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stimulation of antigen-specific T cells (Suto and Srivastava, 1995, supra) and, in addition, 
secretion of pro-inflammatory cytokines such as TNF, GM-CSF and IL-12, Interestingly, 
a2M, an independent ligand for the a2M receptor, inhibits representation of gp96- 
chaperoned peptides by macrophages. This observation suggests that re-presentation of 

5 gp96-chaperoned peptides can not occur physiologically in blood, but only within tissues as 
a result of localized necrotic cell death. This is consistent with the complete absence of gp96 
or other HSPs in blood under all conditions tested. Together, these observations point 
towards a possible mechanism whereby the release of HSPs in the blood as a result of severe 
tissue injury and lysis will not lead to a $ystemic and lethal pro-inflammatory cytokine 

lO cascade. 

It is possible, therefore, that the a2M receptor renders it possible for the APCs to 
sample (i) the extracellular milieu of the blood through a2M and other plasma ligands and 
(ii) the intracellular milieu of the tissues through HSPs, particularly of the gp96 family. The 
former permits APCs to implement their primordial phagocytic function, while the latter 
15 allows them to execute its innate and adaptive immunological functions. Viewed in another 
perspective, recognition of apoptotic cells by APCs through CD36 or phophatidyl serine, 
leads to anti-inflammatory signals, while interaction of the APC with necrotic cells through 
<x2M receptor leads to pro-inflammatory innate arid adaptive immune responses (see 
Srivastava et al t 1998, Immunity 8: 657-665). 

20 

The invention is not to be limited in scope by the specific embodiments described 
which are intended as single illustrations of individual aspects of the invention, and 
functionally equivalent methods and components are within the scope of the invention. 
25 Indeed various modifications of the invention, in addition to those shown and described 
herein will become apparent to those skilled in the art from the foregoing description and 
accompanying drawings. Such modifications are intended to fall within the scope of the 
appended claims. 

All references cited herein, including patent applications, patents, and other 
30 publications, are incorporated by reference herein in their entireties for all purposes. 



35 
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WHAT IS CLAIMED IS : 

1 . A method for identifying a compound that modulates an HSP-a2M 
receptor-mediated process, comprising: 

5 (a) contacting a test compound with a heat shock protein and an alpha (2) 

microglobulin receptor; and 

(b) measuring the level of alpha (2) macroglobulin receptor activity or 
expression, 

such that if the level of activity or expression measured in (b) differs from the level of alpha 
10 (2) macroglobulin receptor activity in the absence of the test compound, then a compound 
that modulates an HSP-a2M receptor-mediated process is identified 

2. The method of Claim 1 , in which the compound identified is an antagonist 
which interferes with the interaction of the heat shock protein with the alpha (2) 

15 macroglobulin receptor, further comprising the step of: 

(c) determining whether the level interferes with the interaction of the heat shock 
protein and the alpha (2) macroglobulin receptor. 

3 . The method of Claim 1 , in which the test compound is an antibody specific 
20 for the alpha (2) macroglobulin receptor. 

4. The method of Claim 1, in which the test compound is an antibody is specific 
for alpha (2) macroglobulin. 

25 5 . The method of Claim 1 , in which the test compound is an antibody is specific 

for a heat shock protein. 

6. The method of Claim 1, in which the test compound is a small molecule. 

30 7. The method of Claim 1, in which the test compound is a peptide. 

8. The method of Claim 7, in which the peptide comprises at least 5 consecutive 
amino acids of the alpha (2) macroglobulin receptor (SEQ ID NO.: 7). 

35 9. The method of Claim 7, in which the peptide comprises at least 5 consecutive 

amino acids of alpha (2) macroglobulin (SEQ ID NO.: 4). 
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10. The method of Claim 7, in which the peptide comprises at least 5 consecutive 
amino acids of a heat shock protein sequence. 

1 1 . The method of Claim 1 , in which the compound is an agonist which enhances 
5 the interaction of the heat shock protein with the alpha (2) macroglobulin receptor. 

12. The method of Claim 1 in which the HSP-a2M receptor-mediated process 
affects an autoimmune disorder, a disease or disorder involving disruption of antigen 
presentation or endocytosis, a disease or disorder involving cytokine clearance or 

10 inflammation, a proliferative disorder, a viral disorder or other infectious disease, 
hypercholesterolemia, Alzheimer's disease, diabetes, or osteoporosis. 

13. A method for identifying a compound that modulates an HSP-a2M 
receptor-mediated process, comprising: 

(a) contacting a test compound with a heat shock protein and an alpha (2) 
macroglobulin receptor-expressing cell; and 

(b) measuring the level of alpha (2) macroglobulin receptor activity or expression 
in the cell, 

such that if the level of activity of expression measured in (b) differs from the level of alpha 
(2) macroglobulin receptor activity in the absence of the test compound, then a compound 
that modulates an HSP-a2M receptor-mediated process is identified. 

14. The method of Claim 1 or 13 wherein the alpha (2) macroglobulin receptor 
activity measured is the ability to interact with a heat shock protein. 

25 

1 5 . The method of Claim 1 3 wherein the heat shock protein is non-covalently 
associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 
measured is the ability to re-present the antigenic peptide. 

30 16. The method of Claim 13 wherein the heat shock protein is non-covalently 

associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 
measured is the ability to stimulate a cytotoxic T cell response against the antigenic peptide. 

1 7. A method for identifying a compound that modulates the binding of a heat 
35 shock protein to the a2M receptor, comprising: 

(a) contacting a heat shock protein with an alpha (2) macroglobulin receptor, or 
fragment, or analog, derivative or mimetic thereof, in the presence of a test 



15 



20 
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compound; and 

(b) measuring the amount of heat shock protein bound to the alpha (2) 

macroglobulin receptor, or fragment, analog, derivative or mimetic thereof, 
such that if the amount of bound heat shock protein measured in (b) differs from the amount 
of bound heat shock protein measured in the absence of the test compound, then a compound 
that modulates the binding of an HSP to the <x2M receptor is identified. 

1 8. The method of Claim 65 wherein the solid surface is a microtiter dish. 



IQ 19. The method of Claim 1 7 wherein the amount of bound heat shock protein is 

measured by contacting the cell with a heat shock protein-specific antibody. 

20. The method of Claim 17 wherein the heat shock protein is labeled and the 
amount of bound heat shock protein is measured by detecting the label. 

15 

21 . The method of Claim 20 wherein the heat shock protein is labeled with a 
fluorescent label. 



22. A method for identifying a compound that modulates heat shock protein- 
20 mediated antigen presentation by alpha (2) macroglobulin receptor-expressing cells 
comprising: 

(a) adding a test compound to a mixture of alpha (2) macroglobulin receptor- 
expressing cells and a complex consisting essentially of a heat shock protein 
noncovalently associated with an antigenic molecule, under conditions 

25 conducive to alpha (2) macroglobulin receptor-mediated endocytosis; 

(b) measuring the level of stimulation of antigen-specific cytotoxic T cells by the 
alpha (2) macroglobulin receptor-expressing cells, 

such that if the level measured in (b) differs from the level of said stimulation in the absence 
of the test compound, then a compound that modulates heat shock protein-mediated antigen 
30 presentation by alpha (2) macroglobulin receptor-expressing cells is identified. 



23. A method for detecting a heat shock protein-alpha (2) macroglobulin receptor- 
related disorder in a mammal comprising measuring the level of activity from an HSP-alpha 
(2) macroglobulin receptor-mediated process in a patient sample, such that if the measured 
35 level differs from the level found in clinically normal individuals, then a heat shock protein- 
alpha (2) macroglobulin receptor-related disorder is detected. 
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24. The method of Claim 23 comprising contacting a sample derived from a 
patient with an antibody specific for the alpha (2) macroglobulin receptor under conditions 
such that immunospecific binding by the antibody. 

5 25. The method of Claim 23 comprising contacting a sample derived from a 

patient with an antibody specific for a heat shock protein under conditions such that 
immunospecific binding by the antibody. 

26. The method of Claim 23 comprising contacting a sample derived from a 
IQ patient with an antibody specific for an HSP-a2M complex under conditions such that 

immunospecific binding by the antibody. 

27. A method for modulating an immune response comprising administering to a 
mammal a purified compound that modulates the interaction of a heat shock protein with the 

15 alpha (2) macroglobulin receptor. 

28. The method of Claim 27, in which the compound is an agonist which 
enhances the interaction of the heat shock protein and the alpha (2) macroglobulin receptor. 

20 29. A method for treating an autoimmune disorder comprising administering to a 

mammal in need of such treatment a purified compound that interferes with the interaction of 
a heat shock protein with the alpha (2) macroglobulin f eceptor. 

30. The method of Claim 29 in which the compound is an antagonist that 
25 interferes with the interaction between the heat shock protein and the a2M receptor. 

3 1 . The method of Claim 30, in which the antagonist is an antibody specific for 
alpha (2) macroglobulin receptor. 

30 32. The method of Claim 30, in which the antagonist is an antibody specific for a 

heat shock protein. 

33. The method of Claim 30, in which the antagonist is a small molecule. 
35 34. The method of Claim 30, in which the antagonist is a peptide. 

35. The method of Claim 30, in which the peptide comprises at least 5 
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consecutive amino acids of alpha (2) macroglobulin receptor (SEQ JD NO.:l). 

36. The method of Claim 30, in which the peptide comprises at least 5 
consecutive amino acids of alpha (2) macroglobulin (SEQ ID NO.: 3). 

5 

37. The method of Claim 30, in which the peptide comprises at least 5 
consecutive amino acids of a heat shock protein sequence. 



38. A method for treating an autoimmune disorder comprising administering to a 
JO mammal in need of such treatment a recombinant cell that expresses an alpha (2) 

macroglobulin receptor which decreases the uptake of a heat shock protein by a functional 
alpha (2) macroglobulin receptor. 

39. A method for increasing the immunopotency of a cancer cell or an infected 

j 5 cell comprising transforming said cell with a nucleic acid comprising a nucleotide sequence 
that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) macroglobulin 
receptor polypeptide. 



40. A method for increasing the immunopotency of a cancer cell or an infected 
20 cell comprising: 

(a) transforming said cell with a nucleic acid comprising a nucleotide sequence 
that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 
macroglobulin receptor polypeptide, and 

(b) administering said cell to an individual in need of treatment, 
25 so as to obtain an elevated immune response. 

41 . A recombinant cancer cell transformed with a nucleic acid comprising a 
nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 
macroglobulin receptor polypeptide. 

30 

42. A recombinant infected cell transformed with a nucleic acid comprising a 
nucleotide sequence that (i) is operably linked to a promoter, and (ii) encodes an alpha (2) 
macroglobulin receptor polypeptide. 



43. The recombinant cell of Claim 41 or 42 which is a human cell. 

44. A kit, comprising in one or more containers: (a) an anti-a2M receptor 
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antibody or a nucleic acid probe capable of hybridizing to an a2M receptor nucleic acid, (b) a 
purified heat shock protein, nucleic acid encoding a heat shock protein, or cell expressing a 
heat shock protein; and (c) instructions for use in detecting a heat shock protein-alpha (2) 
macroglobulin receptor-related disorder. 

5 

45. The kit of Claim 44 wherein the antibody or nucleic acid probe is labeled with 
a detectable marker. 

46. The kit of Claim 44 further comprising a labeled macroglobulin receptor 
10 polypeptide. 

47. A kit, in one or more containers, comprising: (a) a purified heat shock 
protein, nucleic acid encoding a heat shock protein, or cell expressing a heat shock protein; 
and (b) an alpha (2) macroglobulin receptor polypeptide, nucleic acid encoding an alpha (2) 

2 5 macroglobulin receptor polypeptide, or cell expressing an alpha (2) macroglobulin receptor 
polypeptide. 

48. The kit of Claim 47 in which the alpha (2) macroglobulin receptor 
polypeptide, nucleic acid encoding an alpha (2) macroglobulin receptor polypeptide, or cell 

20 expressing an alpha (2) macroglobulin receptor polypeptide is purified. 

49. The kit of Claim 47 further comprising instructions for use in treating an 
autoimmune disorder, an infectious disease, or a proliferative disorder. 

25 50. A method for identifying an a2M receptor fragment capable of binding a heat 

shock protein, said method comprising: 

(a) contacting a heat shock protein, or peptide-binding fragment thereof, with one 
or more alpha (2) macroglobulin receptor fragments; and 

(b) identifying an a2M receptor fragment which specifically binds to the heat 
30 shock protein, or peptide-binding fragment thereof. 

51 . A method for identifying an a2M receptor fragment capable of inducing an 
HSP-a2M receptor-mediated process, said method comprising: 

(a) contacting a heat shock protein with a cell expressing <x2M receptor fragment; 

35 md 

(b) measuring the level of alpha (2) macroglobulin receptor activity in the cell, 

such that if the level of the HSP-a2M receptor-mediated process or activity measured in (b) 
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is greater than the level of alpha (2) macroglobulin receptor activity in the absence of the 
a2M receptor fragment, then an a2M receptor fragment capable of inducing an HSP-a2M 
receptor-mediated process is identified. 

5 52. The method of Claim 5 1 wherein the alpha (2) macroglobulin receptor 

activity measured is the ability to interact with the heat shock protein. 

53. The method of Claim 5 1 wherein the heat shock protein is non-covalently 
associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 

10 measured is the ability to re-present the antigenic peptide. 

54. The method of Claim 5 1 wherein the heat shock protein is non-covalently 
associated with an antigenic peptide and the alpha (2) macroglobulin receptor activity 
measured is the ability to stimulate a cytotoxic T cell response against the antigenic peptide. 

15 

55. A method for identifying a heat shock protein fragment capable of binding an 
<x2M receptor, said method comprising: 

(a) contacting an <x2M receptor with one or more heat shock protein fragments; 
and 

20 (b) identifying a heat shock protein fragment which specifically binds to the <x2M 

receptor. 

56. A method for identifying a heat shock protein fragment capable of inducing 
an HSP-a2M receptor-mediated process, said method comprising: 

25 (a) contacting an a2M receptor fragment with a cell expressing a heat shock 

protein; and 

(b) measuring the level of alpha (2) macroglobulin receptor activity in the cell, 
such that if the level of the HSP-a2M receptor-mediated process or activity measured in (b) 
is greater than the level of alpha (2) macroglobulin receptor activity in the absence of said 

30 heat shock protein fragment, then a heat shock protein fragment capable of inducing an HSP- 
ct2M receptor-mediated process is identified. 

57. The method of Claim 56 wherein the alpha (2) macroglobulin receptor 
activity measured is the ability to interact with the heat shock protein fragment. 

35 

58. The method of Claim 56 wherein the heat shock protein fragment is non- 
covalently associated with an antigenic peptide and the alpha (2) macroglobulin receptor 
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activity measured is the ability to re-present the antigenic peptide. 

59. The method of Claim 56 wherein the heat shock protein fragment is non- 
covalently associated with an antigenic peptide and the alpha (2) macroglobulin receptor 

5 activity measured is the ability to stimulate a cytotoxic T cell response against the antigenic 
peptide. 

60. A method for identifying a molecule that binds specifically to an a2M 
receptor, said method comprising: 

j q (a) contacting an a2M receptor with one or more test molecules under conditions 

conducive to binding; and 
(b) identifying one or more test molecules that specifically bind to the <x2M 
receptor. 

15 61. The method of Claim 60 wherein said test molecules are potential 

immunotherapeutic drugs. 

62. A method for screening for molecules that specifically bind to an a2M 
receptor comprising: 

20 (a) contacting an <x2M receptor with one or more test molecules under conditions 

conducive to binding; and 
(b) determining whether any of said test molecules specifically bind to the a2M 
receptor. 

25 63. A method for identifying a compound that modulates the binding of an o2M 

receptor ligand to the a2M receptor comprising: 

(a) contacting an a2M receptor with an a2M receptor ligand, or an a2M receptor- 
binding fragment, analog, derivative or mimetic thereof, in the presence of 
one or more test compounds; and 
30 (b) measuring the amount of <x2M receptor ligand, or fragment, analog, derivative 

or mimetic thereof, bound to the a2M receptor, 
such that if the amount of bound a2M receptor ligand measured in (b) differs from the 
amount of bound a2M receptor measured in the absence of the test compound, then a 
compound that modulates the binding of an a2M receptor ligand to the a2M receptor is 
35 identified. 

64. The method of Claim 17 or 63, in which the alpha (2) macroglobulin receptor 
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contacted in step (a) is on a cell surface. 



65. The method of Claim 17 or 63, wherein the alpha (2) macroglobulin receptor 
is immobilized to a solid surface. 

5 

66. The method of Claim 1 , 64, or 22 in which the heat shock protein is gp96. 

67. The method of Claim 1 , 64, or 22 in which the heat shock protein is hsp90. 

10 68. The method of Claim 1 , 64, or 22 in which the heat shock protein is hsp70. 

69. The method of Claim 1 , 64, or 22 in which the heat shock protein is 
calreticuhn. 



1 5 70. A method for identifying a compound that modulates the interaction between 

the <x2M receptor and an <x2M receptor ligand, comprising: 

(a) contacting an a2M receptor with one or more test compounds; and 

(b) measuring the level of <x2M receptor activity or expression, 

such that if the level of activity or expression measured in (b) differs from the level of a2M 
20 receptor activity in the absence of one or moire test compounds, then a compound that 

modulates the interaction between the a2M receptor and an <x2M receptor ligand is identified. 



7 1 . The method of Claim 63 or 70 wherein the <x2M receptor ligand is <x2 
macroglobulin. 

25 

72. A method for identifying a compound that modulates antigen presentation by 
<x2M receptor-expressing cells comprising: 

(a) adding one or more test compounds to a mixture of a2M receptor-expressing 
cells and a complex comprising an a2M receptor ligand and an antigenic 

30 molecule, under conditions conducive to a2M receptor-mediated endocytosis; 

(b) measuring the level of stimulation of antigen-specific cytotoxic T cells by the 
a2M receptor-expressing cells, 

such that if the level measured in (b) differs from the level of said stimulation in the absence 
of the one or more test compounds, then a compound that modulates antigen presentation by 
35 a2M receptor-expressing cells is identified. 

73. The method of Claim 22 or 72, in which the measuring stimulation of antigen- 
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specific cytotoxic T cells by the a2M receptor-expressing cells of step (b) comprises: 

(i) adding the alpha (2) macroglobulin receptor-expressing cells formed 
in step (a) to T cells under conditions conducive to the activation of 
the T cells; and 

5 (ii) comparing the level of activation of said cytotoxic T cells with the 

level of activation of T cells by an alpha (2) macroglobulin receptor- 
expressing cell formed in the absence of the test compound, 
wherein an increase of decrease in level of T cell activation indicates that a compound that 
modulates heat shock protein-mediated antigen presentation by alpha (2) macroglobulin 
10 receptor-expressing cells is identified. 



74. A method for modulating an immune response comprising administering to a 
mammal a purified compound that binds to the a2M receptor, in an amount effective to 
modulate an immune response in the mammal. 

15 

75. A method for treating or preventing a disease or disorder comprising 
administering to a mammal a purified compound that binds to the a2M receptor, in an 
amount effective to treat or prevent the disease or disorder in the mammal. 

2o 76. The method of Claim 75 wherein the disease or disorder is cancer or an 

infectious disease. 

77. A method for treating an autoimmune disorder comprising administering to a 
mammal in need of such treatment a purified compound that binds to the a2M receptor, in an 

25 amount effective to treat an autoimmune disorder in the mammal. 

78. A method for stimulating an immune response in a patient comprising 
administering to said patient blood which hal been withdrawn from said patient and treated 
to remove an a2M receptor ligand. 

30 

79. The method of Claim 78 further comprising administering to said patient a 
heat shock protein or a heat shock protein-antigenic peptide complex. 

80. A method for stimulating an immune response in a patient comprising: 

35 (a) removing a <x2M receptor ligand from blood withdrawn from said patient; and 

(b) returning at least a portion of the a2M receptor ligand-depleted blood to said 

patient. 
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81. A method for stimulating an immune response in a patr ^ comprising: 

(a) withdrawing blood from said patient; 

(b) removing a a2M receptor ligand from said blood; and 

(c) returning at least a portion of the a2M receptor ligand-depleted blood to said 

5 patient. 

82. The method of Claim 8 1 further comprising after step (a) and before step (c) 
the step of adding a heat shock protein or a heat shock protein-antigenic peptide complex to 
said blood. 

10 

83. The method of Claims 80 or 8 1 wherein removing a a2M receptor ligand from 
the blood comprises the step of contacting the blood with a solid phase attached to a a2M 
receptor ligand-binding molecule for a time period and under conditions sufficient to allow 
binding of a2M receptor ligand to the <x2M receptor ligand-binding molecule solid phase. 

15 

84. The method of Claim 83 wherein the a2M receptor ligand-binding molecule is 
a2M receptor, or a fragment thereof. 

85. The method of Claim 83 wherein said <x2M receptor ligand-binding molecule 
20 does not bind a heat shock protein. 

86. The method of Claim 85 wherein the a2M receptor ligand-binding molecule is 
an a2M receptor ligand-specific antibody, or a fragment thereof. 

25 87. The method of Claims 80 or 8 1 wherein an apheresis system is used in said 

removing step. 

88. The method of Claim 81 wherein blood is withdrawn manually in said 
withdrawing step. 

30 

89. The method of Claim 80 or 8 1 wherein said removing step comprises 
separating the plasma from said blood and treating said plasma to remove said a2M receptor 
ligand. 

3 5 90. The method of Claim 78 wherein said blood is administered to said patient by 

syringe. 
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91. The method of Claim 78 wherein said blood is administered to said patient by 
an intravenous drip. 

92. Hie method of Claim 80 or 81 wherein said blood is returned to said patient 
5 by syringe. 

93. The method of Claim 80 or 8 1 wherein said blood is returned to said patient 
by an intravenous drip. 

10 94. A kit comprising in one or more containers a solid phase chromatography 

column with a purified a2M receptor ligand binding molecule attached thereto, such that 
withdrawn blood can be run over the column to deplete the blood of a a2M receptor ligand. 

95 . The kit of Claim 94 wherein the a2M receptor ligand binding molecule does 
2 5 not bind heat shock proteins. 

96. The method of Claim 78, 80, or 81 wherein the a2M receptor ligand is a2M, a 
lipoprotein complex, lactofeirin, tissue-type plasminogen activator, urokinase-type 
plasminogen activator, or an exotoxin. 

20 
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30 
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1. Specific binding of HSPs and a 2 -macroglobulin to primary cultures and 
cell lines of several histological origins* 
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GGcSSS S£S5X£ ^ GAGGC GG ^ GGGGG GAGCCCCTAG TGCTCCATCA 60 

?t?ttcSgc ACGCCCCcca ccccccaccc CGCCTCCTCQ 120 

ScSSSS JSJSS^ ™^ TCGGC TCCGAGA TGG GGCTGTGAGC TTCGCCCTGG 180 

^ GGGGGAGA GGAGCGAGGA GTAAAGCAGG GGTGAAGGGT TCGAATTTGG GGGCAGGGGG 240 

CGCACCCGCG TCAGCAGGCC CTTCCCAGGG GGCTCGGAAC TGTACCATTT CACCTATG^ 300 

CCTGGTTCGC TTTGCTTAAG GAAGGATAAG ATAGAAGAGT CGGGGAGAGG SSSgG 360 

SSSErS JSSJSS GGCGAGGACA AGAAGTAACA GGAcSg SS JS 

CTGTTTGCAT CGGCCCACAC C ATG CTG ACC CCG CCG TTG CTG CTG CTC GTG 471 

Met Leu Thr Pro Pro Leu Leu Leu Leu Val 
. 1 5 io 

CCG CTG CTT TCA GCT CTG GTC TCC GGG GCC ACT ATG GAT GCC CCT AAA 519 
Pro Leu Leu Ser Ala Leu Val Ser Gly Ala Thr Met Asp Ala Pro Lys ' 
15 20 25 

ACT TGC AGC CCT AAG CAG TTT GCC TGC AGA* GAC CAA -ATC ACC TGT ATC ' 567 
Thr Cys Ser Pro Lys Gin Phe Ala Cys Arg Asp Gin lie Thr Cys He 
30 35 40 

TCA AAG GGC TGG CGG TGT GAC GGT GAA AGA GAT TGC CCC GAC GGC TCT 615 
Ser Lys Gly Trp Arg Cys Asp Gly Glu Arg Asp Cys Pro Asp Gly Ser 
45 50 55 

GAT GAA GCC CCT GAG ATC TGT CCA CAG AGT AAA GCC CAG AGA TGC CCG 663 
Asp Glu Ala Pro Glu lie Cys Pro Gin Ser Lys Ala Gin Arg Cys Pro 
60 65 70 

CCA AAT GAG CAC AGT TGT CTG GGG ACT GAG CTA TGT GTC CCC ATG TCT 711 
Pro Asn Glu His Ser Cys Leu Gly Thr Glu Leu Cys Val Pro Met Ser 
75 80 85 90 

CGT CTC TGC AAC GGG ATC CAG GAC TGC ATG GAT GGC TCA GAC GAG GGT 759 
Arg Leu Cys Asn Gly He Gin Asp Cys Met Asp Gly Ser Asp Glu Gly 
95 100 105 

GCT CAC TGC CGA GAG CTC CGA GCC AAC TGT TCT CGA ATG GGT TGT CAA 807 
Ala His Cys Arg Glu Leu Arg Ala Asn Cys Ser Arg Met Gly Cys Gin 
HO 115 * 120 

CAC CAT TGT GTA CCT ACA CCC AGT GGG CCC ACG TGC TAC TGT AAC AGC 855 
His His Cys Val Pro Thr Pro Ser Gly Pro Thr Cys Tyr Cys Asn Ser- 
125 130 • 135 

AGC TTC CAG CTC GAG GCA GAT GGC AAG ACG TGC AAA GAT TTT GAC GAG 903 
Ser Phe Gin Leu Glu Ala Asp Gly Lys Thr Cys Lys Asp Phe Asp Glu 
140 145 150 

TGT TCC GTG TAT GGC ACC TGC AGC CAG CTT TGC ACC AAC ACA GAT GGC 951 
Cys Ser Val Tyr Gly Thr Cys Ser Gin Leu Cys Thr Asn Thr Asp Gly 
155 160 165 170 

TCC TTC ACA TGT GGC TGT GTT GAA GGC TAC CTG CTG CAA CCG GAC AAC 999 
Ser Phe Thr Cys Gly Cys Val Glu Gly Tyr Leu Leu Gin Pro Asp Asn 
175 180 - 185 

CGC TCC TGC AAG GCC AAG AAT GAG CCA GTA GAT CGG CCG CCA GTG CTA 10*47 
Arg Ser Cys Lys Ala Lys Asn Glu Pro Val Asp Arg Pro Pro Val Leu 
190 195 200 
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CTG ATT GCC AAC TCT CAG AAC ATC CTA GCT ACG TAC CTG AGT GGG GCC 1095 
Leu lie Ala Asn Ser Gin Asn lie Leu Ala Thr Tyr Leu Ser Gly Ala 
205 210 215 

CAA GTG TCT ACC ATC ACA CCC ACC AGC ACC CGA CAA ACC ACG GCC ATG 1143 
Gin Val Ser Thr lie Thr Pro Thr Ser Thr Arg Gin Thr Thr Ala Met 
220 225 230 

GAC TTC AGT TAT GCC AAT GAG ACC GTA TGC TGG GTG CAC GTT GGG GAC 1191' 
Asp Phe Ser Tyr Ala Asn Glu Thr Val Cys Trp Val His Val Gly Asp 
235 240 ' 245 250 

AGT GCT GCC CAG ACA CAG CTC AAG TGT GCC CGG ATG CCT GGC CTG AAG 1239 
Ser Ala Ala Gin Thr Gin Leu Lys Cys Ala Arg Met Pro Gly Leu Lys 
255 260 265 

GGC TTT GTG GAT GAG CAT ACC ATC AAC ATC TCC CTC AGC CTG CAC CAC 1287 
Gly Phe Val Asp Glu His Thr lie Asn lie Ser Leu Ser Leu His His 
270 . . s - 275 280 

GTG . GAG CAG ATG GCA ATC GAC TGG CTG ACG GGA AAC TTC TAC TTT GTC 1335 
Val Glu Gin Met Ala He Asp Trp Leu Thr Gly Asn Phe Tyr Phe Val 
285 290 ' 295 

GAC GAC ATT GAC GAC AGG ATC TTT GTC TGT AAC CGA AAC GGG GAC ACC 1383 
Asp Asp He Asp Asp Arg He Phe Val Cys Asn Arg Asn Gly Asp Thr 
300 305 310 

TGT GTC ACT CTG CTG GAC CTG GAA CTC TAC AAC CCC AAA GGC ATC GCC 1431 
Cys Val Thr Leu Leu Asp Leu Glu Leu Tyr Asn * Pro Lys Gly He Ala 
315 320 325 " 330 

TTG GAC CCC GCC ATG GGG AAG GTG TTC TTC ACT GAC TAC GGG CAG ATC 1479 
Leu Asp Pro Ala Met Gly Lys Val Phe Phe Thr Asp Tyr Gly Gin He 
335 340 - 345 

CCA AAG GTG GAG CGC TGT GAC ATG GAT GGA CAG AAC CGC ACC AAG CTG 1527 
Pro Lys Val Glu Arg Cys Asp Met Asp Gly Gin Asn Arg Thr Lys Leu 
350 355 360 

GTG GAT AGC AAG ATC GTG TTT CCA CAC GGC ATC ACC CTG GAC CTG GTC 1575 
Val Asp Ser Lys He Val Phe Pro His Gly lie Thr Leu Asp Leu Val " 
365 370 375 

AGC CGC CTC GTC TAC TGG GCG GAC GCC TAC CTA GAC TAC ATC GAG GTG 1623 
Ser Arg Leu Val Tyr Trp Ala Asp. Ala Tyr Leu Asp Tyr lie Glu Val 
380 385 390 

GTA GAC TAC GAA GGG AAG GGT CGG CAG ACC ATC ATC CAA GGC ATC CTG 1671 
Val Asp Tyr Glu Gly Lys Gly Arg Gin Thr He He Gin Gly He Leu 
395 400 405 410 

ATC GAG CAC CTG TAC GGC CTG ACC GTG TTT GAG AAC TAT CTC TAC GCC 1719 
He Glu His Leu Tyr Gly Leu Thr Val Phe Glu Asn Tyr Leu Tyr Ala 
415 420 425 

ACC AAC TCG- GAC AAT GCC AAC ACG CAG CAG AAG ACG AGC GTG ATC CGA 1767 
Thr Asn Ser Asp Asn Ala Asn Thr Gin Gin Lys Thr Ser Val lie Arg 
430 435 440 
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GTG AAC CGG TTC AGT ACT GAG TAG CAG GTC GTC ACC CGT GTG GAC 1815 

Val Asn Arg Phe Asn Ser Thr Glu Tyr Gin Val Val Thr Arg Val Asp 
445 450 4 55 

AAG GGT GGT GCC CTG CAT ATC TAC CAC CAG CGA CGC CAG CCC CGA GTG 1863 
Lys Gly Gly Ala Leu His lie Tyr His Gin Arg Arg Gin Pro Arg Val 
460 465 470 

CGG AGT CAC GCC TGT GAG AAT GAC CAG TAC GGG AAG CCA GGT GGC TGC 1911 
Arg Ser His Ala Cys Glu Asn Asp Gin Tyr Gly Lys Pro Gly Gly Cys 
475 480 * 485 " 490 

TCC GAC ATC TGC CTC CTG GCC AAC AGT CAC AAG GCA AGG ACC TGC AGG 1959 
Ser Asp lie Cys Leu Leu Ala Asn Ser His Lys Ala Arg Thr Cys Arg 
495 500 ** 505 

TGC AGG TCT GGC TTC AGC CTG GGA AGT GAT GGG AAG TCT TGT AAG AAA 2007 
Cys Arg Ser Gly Phe Ser Leu Gly Ser Asp Gly Lys Ser Cys Lys Lys 
510 515 520 

CCT GAA CAT GAG CTG TTC CTC GTG TAT GGC AAG GGC CGA CCA GGC ATC 2055 
Pro Glu His Glu Leu Phe Leu Val Tyr Gly Lys Gly Arg Pro Gly He 
525 530 * 535 

ATT AGA GGC ATG GAC ATG GGG GCC AAG GTC CCA GAT GAG CAC ATG ATC 2103 
He Arg Gly Met Asp Met Gly Ala Lys Val Pro Asp Glu His Met He 
540 545 550 

CCC ATC GAG AAC CTT ATG AAT CCA CGC GCT CTG GAC TTC CAC GCC GAG 2151 
Pro He Glu Asn Leu Met Asn Pro Arg Ala Leu Asp Phe His Ala Glu 
555 560 565 570 

ACC GGC TTC ATC TAC TTT GCT GAC ACC ACC AGC TAC CTC ATT GGC CGC 2199 
Thr Gly Phe He Tyr Phe Ala Asp Thr Thr Ser Tyr Leu He Gly Arg 
575 S80 585 

CAG AAA ATT GAT GGC ACG GAG AGA GAG ACT ATC CTG AAG GAT GGC ATC 2247 
Gin Lys He Asp Gly Thr Glu Arg Glu thr He Leu Lys Asp Gly He 
590 595 600 

CAC AAT GTG GAG GGC GTA GCC GTG GAC TGG ATG GGA GAC AAT CTT TAC 2295 
His Asn Val Glu Gly Val Ala Val Asp Trp Met Gly Asp Asn Leu Tyr ■ 
605 610 615 

TGG ACT GAT GAT GGC CCC AAG AAG ACC ATT AGT GTG GCC AGG CTG GAG 2343 
Trp Thr Asp Asp Gly Pro Lys Lys Thr He Ser Val Ala Arg Leu Glu 
620 ' 625 630 

AAA GCC GCT CAG ACC CGG AAG ACT CTA ATT GAG GGC AAG ATG ACA CAC 2391 
Lys Ala Ala Gin Thr Arg Lys Thr Leu He Glu Gly Lys Met Thr His 
635 640 645 650 

CCC AGG GCC ATT GTA GTG GAT CCA CTC AAT GGG TGG ATG TAC TGG ACA 2439 
Pro Arg Ala He Val Val Asp Pro Leu Asn Gly Trp Met Xyr Trp Thr 
655 660 665 

GAC TGG GAG • GAG GAC CCC AAG GAC AGT CGG CGA GGG CGG CTC GAG AGG " 24*87 
Asp Trp Glu Glu Asp Pro Lys Asp Ser Arg Arg Gly Arg Leu Glu- Arg 
670 675 680 
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GCT TGG ATG GAC GGC TCA CAC CGA GAT ATC TTT GTC ACC TCC AAG ACA 2535 
Ala Trp Met Asp Gly Ser His Arg Asp lie Phe Val Thr Ser Lys Thr 
685 690 695 

GTG CTT TGG CCC AAT GGG CTA AGC CTG GAT ATC CCA GCC GGA CGC CTC 2583 
Val Leu Trp Pro Asn Gly Leu Ser Leu Asp He Pro Ala Gly Arg Leu 
700 705 710 

TAC TGG GTG GAT GCC TTC TAT GAC CGA ATT GAG ACC ATA CTG CTC AAT 2631 
Tyr Trp Val Asp Ala Phe Tyr Asp Arg He Glu Thr He Leu Leu Asn 
715 720 725 730 

GGC ACA GAC CGG AAG ATT GTA TAT GAG GGT CCT GAA CTG AAT CAT GCC 2679 
Gly Thr Asp Arg Lys lie Val Tyr Glu Gly Pro Glu Leu Asn His Ala 
735 740 745 

TTC GGC CTG TGT CAC CAT GGC AAC TAC CTC TTT TGG ACC GAG' TAG CGG 2727 
Phe Gly Leu Cys His His Gly Asn Tyr Leu Phe Trp Thr Glu Tyr Arg 
750 755 760 

AGC GGC AGC GTC TAC CGC TTG GAA CGG GGC GTG GCA GGC GCA CCG CCC 2775 
Ser Gly Ser Val Tyr Arg Leu Glu Arg Gly Val Ala Gly Ala Pro Pro 
765 770 775 

ACT GTG ACC CTT CTG CGC AGC GAG AGA CCG CCT ATC TTT GAG ATC CGA 2823 
Thr Val Thr Leu Leu Arg Ser Glu Arg Pro Pro He Phe Glu lie Arg 
780 785 790 

ATG TAC GAC GCG CAC GAG CAG CAA GTG GGT ACC AAC AAA TGC CGG GTA 2871 
Met Tyr Asp Ala His Glu Gin Gin Val Gly Thr Asn Lys Cys Arg Val 
795 800 805 810 

AAT AAC GGA GGC TGC AGC AGC CTG TGC CTC GCC ACC CCC GGG AGC CGC. 2919 
Asn Asn Gly Gly Cys Ser Ser Leu Cys Leu Ala Thr Pro Gly Ser Arg 
815 820 825 

CAG TGT GCC TGT GCC GAG GAC CAG GTG TTG GAC ACA GAT GGT GTC ACC 2967 
Gin Cys Ala Cys Ala Glu Asp Gin Val Leu Asp Thr Asp Gly Val Thr 
830 835 840 

TGC TTG GCG AAC CCA TCC TAC GTG CCC CCA CCC CAG TGC CAG CCG GGC 3015 
Cys Leu Ala Asn Pro Ser Tyr Val Pro Pro Pro Gin Cys Gin Pro Gly * 
845 850 855 

CAG TTT GCC TGT GCC AAC AAC CGC TGC ATC CAG GAG CGC TGG AAG TGT 3063 
Gin Phe Ala Cys Ala Asn Asn Arg Cys He Gin Glu Arg Trp Lys Cys 
860 865 870 

GAC GGA GAC AAC GAC TGT CTG GAC AAC AGC GAT GAG GCC CCA GCA CTG 3111 
Asp Gly Asp Asn Asp Cys Leu Asp Asn Set Asp Glu Ala Pro Ala Leu 
875 880 885 B90 

TGC CAT CAA CAC ACC TGT CCC TCG GAC CGA TTC AAG TGT GAG AAC AAC 3J.59 
Cys His Gin His Thr Cys Pro Ser Asp Arg Phe Lys Cys Glu Asn Asn 
895 900 905 

CGG TGT ATC CCC AAC CGC TGG CTC TGT GAT GGG GAT AAT GAT TGT GGC 3207 
Arg Cys He Pro Asn Arg Trp Leu Cys Asp Gly Asp Asn Asp Cys Gly 
910 915 920 
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AAC AGC GAG GAC GAA TCC AAT GCC ACG TGC TCA GCC CGC ACC TGT CCA 3255 
Asn Ser Glu Asp GIu Ser Asn Ala Thr Cys Ser Aia Arg Thr Cys Pro 
925 930 935 

CCC AAC CAG TTC TCC TGT GCC AGT GGC CGA TGC ATT CCT ATC TCA TGG 3303 
Pro Asn Gin Phe Ser Cys Ala Ser Gly. Arg Cys lie Pro lie Ser Trp 
940 945 ' 950 

ACC TGT GAT CTG GAT GAT GAC TGT GGG GAC CGG TCC GAT GAG TCA GCC 3351 
Thr Cys Asp Leu Asp Asp Asp Cys Gly Asp Arg Ser Asp Glu Ser Ala 
955 960 965 970 

TCA TGC GCC TAC CCC ACC TGC TTC CCC CTG ACT CAA TTT ACC TGC AAC 3399 
Ser Cys Ala Tyr Pro Thr Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn- 
975 980 985 

AAT GGC AG A TGT ATT AAC ATC AAC TGG CGG TGT GAC AAC GAC AAT GAC 3447 
Asn Gly Arg Cys lie Asn lie Asn Trp Arg Cys Asp Asn Asp Asn Asp 
990 995 1000 

TGT GGG GAC AAC AGC GAC GAA GCC GGC TGC AGT CAC TCC TGC TCC AGT 34 95 
Cys Gly Asp Asn Ser Asp Glu Ala Gly Cys Ser His Ser Cys Ser Ser 
1005 1010 1015 

ACC CAG TTC AAG TGC AAC AGT GGC AGA TGC ATC CCC GAG CAC TGG ACG 354 3 
Thr Gin Phe Lys Cys Asn Ser Gly Arg Cys lie Pro Glu His Trp Thr 
1020 1025 1030 

TGT GAT GGG GAC AAT GAT TGT GGG GAC TAC AGC GAC GAG ACA CAC GCC 3591 
Cys Asp Gly Asp Asn Asp Cys Gly Asp Tyr Ser Asp Glu Thr His Ala 
1035 " 1040. 1045 1050 

AAC TGT ACC AAC CAG GCT ACA AGA CCT CCT GGT GGC TGC CAC TCG GAT 3639 
Asn Cys Thr Asn Gin Ala Thr Arg Pro Pro Gly Gly Cys His Ser Asp 
1055 1060 1065 

GAG TTC CAG TGC CCG CTA GAT GGC CTG TGC ATC CCC CTG AGG TGG CGC 3687 
Glu Phe Gin Cys Pro Leu Asp Gly Leu Cys lie Pro Leu Arg Trp Arg 
1070 1075 1080 

TGC GAC GGG GAC ACC GAC TGC ATG GAT TCC AGC GAT GAG AAG AGC TGT 3735 
Cys Asp Gly Asp Thr Asp Cys Met Asp Ser Ser Asp Glu Lys Ser Cys 
1085 1090 1095 

GAG GGC GTG ACC CAT GTT TGT GAC CCG AAT GTC AAG TTT GGC TGC AAG 3783 
Glu Gly Val Thr His Val Cys Asp Pro Asn Val Lys Phe Gly. Cys Lys 
1100 1105 1110 

GAC TCC GCC CGG TGC ATC AGC AAG GCG TGG GTG TGT GAT GGC GAC AGC 3831 
Asp Ser Ala Arg Cys He Ser Lys Ala Trp Val Cys Asp Gly Asp Ser 
1115 1120 1125 H30 

GAC TGT GAA GAT AAC TCC GAC GAG GAG AAC TGT GAG GCC CTG GCC TGC 3879 
Asp Cys Glu Asp Asn Ser Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys 
1135 1140 1X45 

AGG CCA CCC TCC CAT CCC TGC GCC AAC AAC ACC TCT GTC TGC CTG CCT 3927 
Arg Pro Pro Ser His Pro Cys Ala Asn Asn Thr Ser Val Cys Leu Pro 
1150 H55 1160 
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CCT GAC AAG CTG TGC GAC GGC AAG GAT GAC TGT GGA GAC GGC TCG GAT 3975 
Pro Asp Lys Leu Cys Asp Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp 
1165 1170 li75 

GAG GGC GAG CTC TGT GAC CAG TGT TCT CTG AAT AAT GGT GGC TGT AGT 4023 
Glu Gly Glu Leu Cys Asp Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser 
1180. 1185 \ 1190 

CAC AAC TGC TCA GTG GCC CCT GGT GAA GGC ATC GTG TGC TCT TGC CCT 4071 
His Asn Cys Ser Val Ala Pro Gly Glu Gly lie Val Cys Ser Cys Pro 
1195 1200- 1205 1210 

CTG GGC ATG GAG CTG GGC TCT GAC AAC CAC ACC TGC CAG ATC CAG AGC 411*9 
Leu Gly Met Glu Leu Gly Ser Asp Asn His Thr Cys Gin He Gin Ser 
1215 1220 1225 

TAC TGT GCC AAG CAC CTC AAA TGC AGC CAG AAG TGT GAC CAG AAC AAG 4167 
Tyr Cys Ala Lys His Leu Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys 
1230 1235 1240 

TTC AGT GTG AAG TGC TCC TGC TAC GAG GGC TGG GTC TTG GAG CCT GAC 4215 
Phe Ser Val Lys Cys Ser Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp 
1245 1250 1255 

GGG GAA ACG TGC CGC AGT CTG GAT CCC TTC AAA CTG TTC ATC ATC TTC 4263 
Gly Glu Thr Cys Arg Ser Leu Asp Pro Phe Lys Leu Phe He He Phe 
1260 1265 1270 

TCC AAC CGC CAC GAG ATC AGG CGC ATT GAC CTT CAC AAG GGG GAC TAC 4311 
Ser Asn Arg His Glu He Arg Arg He Asp Leu His Lys Gly Asp Tyr 
1275 1280 1285 - ' 1290 

AGC GTC CTA GTG CCT GGC CTG CGC AAC ACT ATT GCC CTG GAC TTC CAC 4359 
Ser Val Leu Val Pro Gly Leu Arg Asn Thr He Ala Leu Asp Phe His 
1295 1300 1305 



CTC AGC CAG AGT GCC CTC TAC TGG ACC GAC GCG GTA GAG GAC AAG ATC 
Leu Ser Gin Ser Ala Leu Tyr Trp Thr Asp Ala Val Glu Asp Lys He 
1310 1315 1320 



4407 



TAC CGT GGG AAA CTC CTG GAC AAC GGA GCC CTG ACC AGC TTT GAG GTG 
Tyr Arg Gly Lys Leu Leu Asp Asn Gly Ala Leu Thr Ser Phe Glu Val* 
1325 1330 1335 



4455 



GTG ATT CAG TAT GGC TTG GCC ACA CCA GAG GGC CTG GCT GTA GAT TGG 
Val lie Gin Tyr Gly Leu Ala Thr Pro Glu Gly Leu Ala Val Asp Trp 
1340 1345 1350 • 



4503 



ATT GCA GGC AAC ATC TAC TGG GTG GAG AGC AAC CTG GAC CAG ATC GAA 4551 
He Ala Gly Asn He Tyr Trp Val Glu Ser Asn Leu Asp Gin He Glu 
1355 1360 1365 " 1370 

GTG GCC AAG CTG GAC GGA ACC CTC CGA ACC ACT CTG CTG GCG GGT GAC 4599 
Val Ala Lys Leu Asp Gly Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp 
1375 1380 1385 



ATT GAG CAC CCG AGG GCC ATC GCT CTG GAC CCT CGG GAT GGG ATT CTG 
He Glu His Pro Arg Ala He Ala Leu Asp Pro Arg Asp Gly He Leu 
1390 1395 1400 



4647 
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TTT TGG ACA GAC TGG GAT GCC AGC CTG CCA CGA ATC GAG GCT GCA TCC 4695 
Phe Trp Thr Asp Trp Asp Ala Ser Leu Pro Arg He Glu Ala Ala Ser 
1405 1410 1415 

ATG AGT GGA GCT GGC CGC CGA ACC ATC CAC CGG GAG ACA GGC TCT GGG 4743 
Met Ser Gly Ala Gly Arg Arg Thr He His Arg Glu Thr Gly Ser Gly 
1420 1425 1430 

GGC TGC GCC AAT GGG CTC ACC GTG GAT TAC CTG GAG AAG CGC ATC CTC 4791 
Gly Cys Ala Asn Gly Leu Thr Val Asp Tyr Leu Glu Lys Arg He Leu 
1435 1440* 1445 ^ 1450 

TGG ATT GAT GCT AGG TCA GAT GCC ATC TAT TCA GCC CGG TAT GAC GGC 4839 
Trp He Asp Ala Arg Ser Asp Ala He Tyr Ser Ala Arg Tyr Asp Gly 
1455 1460 - 1465 

TCC GGC CAC ATG GAG GTG CTT CGG GGA CAC GAG TTC CTG TCA CAC CCA 4887 
Ser Gly His Met Glu Val Leu Arg Gly His Glu Phe Leu Ser His Pro 
1470 1475 1480 

TTT GCC GTG ACA CTG TAC GGT GGG GAG GTG TAC TGG ACC GAC TGG CGA 4935 
Phe Ala Val Thr Leu Tyr Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg 
1485 1490 1495 

ACA AAT ACA CTG GCT AAG GCC AAC AAG TGG ACT GGC CAC AAC GTC ACC 4983 
Thr Asn Thr Leu Ala Lys Ala Asn Lys Trp Thr Gly His Asn Val Thr 
1500 m 1505 1510 

GTG GTA CAG AGG ACC AAC ACC CAG CCC TTC GAC CTG CAG GTG TAT CAC 5031 
Val Val Gin Arg Thr Asn Thr Gin Pro Phe Asp Leu Gin Val Tyr His 
1515 1520 1525 1530 

CCT TCC CGG CAG CCC ATG GCT CCA AAC CCA TGT GAG GCC AAT GGC GGC 5079 
Pro Ser Arg Gin Pro Met Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly 
1535 1540 1545 

CGG GGC CCC TGT TCC CAT CTG TGC CTC ATC AAC TAC AAC CGG ACC GTC 5127 • 
Arg Gly Pro Cys Ser His Leu Cys Leu He Asn Tyr Asn Arg Thr Val 
1550 1555 1560 

TCC TGG GCC TGT CCC CAC CTC ATG AAG CTG CAC AAG GAC AAC ACC ACC 5175 
Ser Trp Ala Cys Pro His Leu Met Lys Leu His Lys Asp Asn Thr Thr 
1565 1570 1575 

TGC TAT GAG TTT AAG AAG TTC CTG CTG TAC GCA CGT CAG ATG GAG ATC 5223 
Cys Tyr Glu Phe Lys Lys Phe Leu Leu Tyr Ala Arg Gin Met Glu He 
1580 1595 1590 

CGG GGC GTG GAC CTG GAT GCC CCG TAC TAC AAT TAT ATC ATC TCC TTC 5271 
Arg Gly Val Asp Leu Asp Ala Pro Tyr Tyr Asn Tyr He lie Ser Phe 
1595 1600 1605 1610 

ACG GTG CCT GAT ATC GAC AAT GTC ACG GTG CTG GAC TAT GAT GCC CGA . 5319 
Thr Val Pro Asp He Asp Asn Val Thr Val Leu Asp Tyr- Asp Ala Arg 
1615 1620 1625 

GAG CAG CGA GTT TAC TGG TCT GAT GTG CGG ACT CAA GCC ATC AAA AGG 5367 
Glu Gin Arg Val Tyr Trp Ser Asp Val Arg Thr Gin Ala He Lys Arg 
1630 1635 1640 
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GCA TTT ATC AAC GGC ACT GGC GTG GAG ACC GTT GTC TCT GCA GAC TTG 
Ala Phe lie Asn Gly Thr Gly Val Glu Thr Val Val Ser Ala Asp Leu 
1645 1650 1655 



5415 



CCC AAC GCC CAC GGG CTG GCT GTG GAC TGG GTC TCC CGA AAT CTG TTT 
Pro Asn Ala His Gly* Leu Ala Val Asp Trp Val Ser Arg Asn Leu Phe 
1660 1665 1670 



5463 



TGG ACA AGT TAC GAC ACC AAC AAG AAG CAG ATT AAC GTG GCC CGC CTG 
Trp Thr Ser Tyr Asp Thr Asn Lys Lys Gin lie Asn Val Ala Arg Leu 
1675 1680 ' 1685 1690 



5511 



GAC GGC TCC TTC AAG AAT GCG GTG GTG CAG GGC CTG GAG CAG CCC CAC 
Asp Gly Ser Phe Lys Asn Ala Val Val Gin Gly Leu Glu Gin Pro His 
1695 1700 1705 



5559. 



GGC CTG GTC GTC CAC CCG CTT CGT GGC AAG CTC TAC TGG ACT GAT GGG 
Gly Leu Val Val His Pro Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly 
1710 1715 1720 



5607 



GAC AAC ATC AGC ATG GCC AAC ATG GAT GGG AGC AAC CAC ACT CTG CTC 
Asp Asn He Ser Met Ala Asn Met Asp Gly Ser Asn His Thr Leu Leu 
1725 1730 17 35 



5655 



TTC AGT GGC CAG . AAG GGC CCT GTG GGG TTG GCC ATT GAC TTC CCT GAG 
Phe Ser Gly Gin Lys Gly Pro Val Gly Leu Ala He Asp Phe Pro Glu 
1740 1745 1750 



5703 



AGC AAA CTC TAC TG<3 ATC AGC TCT GGG AAC CAC ACA ATC AAC CGT TGC 
Ser Lys Leu Tyr Trp He Ser Ser Gly Asn His Thr lie Asn Arg Cys 
1755 1760 1765 1770 



5751 



AAT CTG GAT GGG AGC GAG CTG GAG GTC ATC GAC ACC ATG CGG AGC CAG 
Asn Leu Asp Gly Ser Glu Leu Glu Val He Asp Thr Met Arg Ser Gin 
1775 i780 1785 



5799 



CTG GGC AAG GCC ACT GCC CTG GCC ATC ATG GGG GAC AAG CTG TGG TGG 
Leu Gly Lys Ala Thr Ala Leu Ala He Met Gly Asp Lys Leu Trp Trp 
1790 1795 1800 



5847 . 



GCA GAT CAG GTG TCA GAG AAG ATG GGC ACG TGC AAC AAA GCC GAT GGC 
Ala Asp Gin Val Ser Glu Lys Met Gly Thr Cys Asn Lys Ala Asp Gly 
1805 1810 1815 



5895 



TCT GGG TCC GTG GTG CTG CGG AAC AGT ACC ACG TTG GTT ATG CAC ATG 
Ser Gly Ser Val Val Leu Arg Asn Ser Thr Thr Leu Val Met His Met 
1820 1825 1830 



5943 



AAG GTG TAT GAC GAG AGC ATC CAG CTA GAG CAT GAG GGC ACC AAC CCC 5991 
Lys Val Tyr Asp Glu Ser He Gin Leu Glu His Glu Gly Thr Asn Pro 
1835 1840 1845 1850 

TGC AGT GTC AAC AAC GGA GAC TGT TCC CAG CTC TGC CTG CCA ACA TCA 6039 
Cys Ser Val Asn Asn Gly Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser 
1855 1860 1865 



GAG ACG ACT CGC TCC TGT ATG TGT ACA GCC GGT TAC AGC CTC CGG AGC 
Glu Thr Thr Arg Ser Cys Met Cys Thr Ala Gly Tyr Ser Leu Arg, Ser 
1870 1875 1880 



6087 



FflGn 12A 



WO 01/92474 PCT/US01/18041 

29/91 



GGA CAG CAG GCC TGT GAG GGT GTG GGC TCT TTT CTC CTG TAG TCT GTA 6135 
Gly Gin Gin Ala Cys Glu Gly Val Gly Ser Phe Leu Leu Tyr Ser Val 
1885 1890 1895 

CAT GAG GGA ATT CGG GGG ATT CCA CTA GAT CCC AAT GAC AAG TCG GAT 6183 
His Glu Gly He Arg Gly He Pro Leu Asp Pro Asn Asp Lys Ser Asp 
1900 1905 1910 

GCC CTG GTC CCA GTG TCC GGA ACT TCA CTG GCT GTC GGA ATC GAC TTC 6231 
Ala Leu Val Pro Val Ser Gly Thr Ser Leu Ala Val Gly lie Asp Phe 
1915 1920 1925 1930 

CAT GCC GAA AAT GAC ACT ATT TAT TGG GTG GAT ATG GGC CTA AGC ACC 6279 
His Ala Glu Asn Asp Thr He Tyr Trp Val Asp Met Gly Leu Ser Thr 
1935 1940 1945 

ATC AGC AGG GCC AAG CGT GAC CAG ACA TGG CGA GAG GAT GTG GTG ACC 6327 
He Ser Arg Ala Lys Arg Asp Gin Thr Trp Arg Glu Asp Val Val Thr 
1950 1955 I960 

AAC GGT ATT GGC CGT GTG GAG GGC ATC GCC GTG GAC TGG ATC GCA GGC 6375 
Asn Gly He Gly Arg Val Glu Gly He Ala Val Asp Trp He Ala Gly 
1965 1970 1975 

AAC ATA TAC TGG ACG GAC CAG GGC TTC GAT GTC ATC GAG GTT GCC CGG 6423 
Asn He Tyr Trp Thr Asp Gin Gly Phe Asp Val He Glu Val Ala Arg 
1980 1985 1990 

CTC AAT GGC TCT TTT CGT TAT GTG GTC ATT TCC CAG GGT CTG GAC AAG 6471 
Leu Asn Gly Ser Phe Arg Tyr Val Val He Ser' Gin Gly Leu Asp Lys 
19 95 2000 2005 2010 

CCT CGG GCC ATC ACT GTC CAC CCA GAG AAG GGG TAC TTG TTC TGG ACC 6519 
Pro Arg Ala He Thr Val His Pro Glu Lys Gly Tyr Leu Phe Trp Thr 
2015 2020 2025 

GAG TGG GGT CAT TAC CCA CGT ATT GAG CGG TCT CGC CTT GAT GGC ACA 6567 
Glu Trp Gly His Tyr Pro Arg He Glu Arg Ser Arg Leu Asp Gly Thr 
2030 2035 2040 

GAG AGA GTG GTG TTG GTT AAT GTC AGC ATC AGC TGG CCC AAT GGC ATC 6615 
Glu Arq Val Val Leu Val Asn Val Ser lie Ser Trp Pro Asn Gly He 
2045 2050 2055 

TCA GTA GAC TAT CAG GGC GGC AAG CTC TAC TGG TGT GAT GCT CGG ATG 6663 
Ser Val Asp Tyr Gin Gly Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met 
2060 & " 2065 2070 

GAC AAG ATC GAG CGC ATC GAC CTG GAA ACG GGC GAG AAC CGG GAG GTG 6711 
Asp Lys He Glu Arg He Asp Leu Glu Thr Gly Glu Asn Arg Glu Val 
2075 2080 2085 2090 

GTC CTG TCC AGC AAT AAC ATG GAT ATG TTC TCC GTG TCC GTG TTT GAG 6759 
Val Leu Ser Ser Asn Asn Met Asp Met Phe Ser Val Ser yal Phe Glu 
2095 2100 2105 

GAC TTC ATC TAC TGG AGT GAC AGA ACT CAC GCC AAT GGC TCC ATC AAG 6807 
Asp Phe He Tyr Trp. Ser Asp Arg Thr His Ala Asn Gly Ser He Lys 
2110 2115 2120 
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6951 



S« ??S l G ° ^ t AC 5* T ?* ACA GAC TCC GTG CCT CTG AGG ACA GGC 6855 
Arg Gly Cys Lys Asp Asn Ala Thr Asp Ser Val Pro Leu Arg Thr Gly 
2125 2130 2135 

ATT GGT GTT CAG CTT AAA GAC ATC AAG GTC TTC AAC AGG GAC AGG CAG 6903 
ln LeU hys As P Ile L y* Val Phe Asn Arg Asp Arg Gin 
2140 2145 2150 

AAG GGT ACC AAT GTG TGC GCG GTA GCC AAC GGC GGG TGC CAG CAG CTC 
Lys Gly Thr Asn Val Cys Ala Val Ala Asn Gly Gly Cys Gin Gin Leu 
2155 2XSQ. 2165 2170 

TGC TTG TAT CGG GGT GGC GGA CAG CGA GCC TGT GCC TGT GCC CAC GGG 6999 
Cys Leu Tyr Arg Gly Gly Gly Gin Arg Ala Cys Ala Cys Ala His Gly 
2175 2160 2185 

ATG CTG GCA GAA GAC GGG GCC TCA TGC CGA GAG TAC GCT GGC TAC CTG 7047 
Met Leu Ala Glu Asp Gly Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu 
2190 2195 2200 

CTC TAC TCA GAG CGG ACC ATC CTC AAG AGC ATC CAC CTG TCG GAT GAG 709S 
Leu Tyr Ser Glu Arg Thr Ile Leu Lys Ser Ile His Leu Ser Asp Glu 
2205 2210 2215 

CGT AAC CTC AAC GCA CCG GTG CAG CCC TTT GAA GAC CCC GAG CAC ATG 7143 
Arg Asn Leu Asn Ala Pro Val Gin Pro Phe Glu Asp Pro Glu His Met 
2220 2225 2230 

AAA AAT GTC ATC GCC CTG GCC TTT GAC TAC CGA GCA GGC ACC TCC CCG 7191 
Lys Asn Val lie Ala Leu Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro 
2235 2240 2 245 2250 

GGG ACC CCT AAC CGC ATC TTC TTC AGT GAC ATC CAC TTT GGG AAC ATC 7239 
Gly Thr Pro Asn Arg lie Phe Phe Ser Asp lie His Phe Gly Asn Ile 
2255 2260 2265 

CAG CAG ATC AAT GAC GAT GGC TCG GGC AGG ACC ACC ATC GTG GAA AAT 7287 
Gin Gin Ile Asn Asp Asp Gly Ser Gly Arg Thr Thr Ile Val Glu Asn 
2270 2275 2280 

GTG GGC TCT GTG GAA GGC CTG GCC TAT CAC CGT GGC TGG GAC ACA CTG 7335 
Val Gly Ser Val Glu Gly Leu Ala Tyr His Arg Gly Trp- Asp Thr Leu • 
2285 2290 2295 

TAC TGG ACA AGC TAC ACC ACA TCC ACC ATG ACC CGC CAC ACC GTG GAC 7383 
Tyr Trp Thr Ser Tyr Thr Thr Ser Thr Ile Thr Arg His Thr Val Asp 
2300 2305 2310 

CAG ACT CGC CCA GGG GCC TTC GAG AGG GAG ACA GTC ATC ACC ATG TCC 7431 
Gin Thr Arg Pro Gly Ala Phe Glu Arg Glu Thr Val lie Thr Met Ser 
2315 2320 2325 2330 

GGA GAC GAC CAC CCG AGA GCC TTT GTG CTG GAT GAG TGC CAG AAC CTG 7479 
Gly Asp Asp His Pro Arg Ala Phe Val Leu Asp Glu Cys Qln Asn Leu 
2335 2340 * 2345 

ATG TTC TGG ACC AAT TGG AAC GAG CTC CAT CCA AGC ATC ATG CGG GCA 7521 
Met Phe Trp Thr Asn Trp Asn Glu Leu His Pro Ser lie Met Arg Ala 
2350 2355 2360 
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GCC CTA TCC GGA GCC AAC GTC CTG Jver r?p n** /■■* _ _ 
Ala Leu Ser Gly Ala Asn Val ?hr 2S tF ^ ^ G GAC ATC CGC 
2365 o*™ U Ile Glu ^ As P Ile 

2375 

SS So XI S? S?S S S K £° g 6 !T MS CTG TAG TTC 
2380 ' %tpc P His Ar 9 Ala Glu.Lys Leu Tyr Phe 

2385 : 2390 

sS £S S ?£ 2S So £ G ^ G CGC tgcgag tac gac «° ** 

2 Ser Asp Ala Thr Leu Asp Lys lie Glu Arg Cys Glu Tyr Asp Gly Ser 

2405 2410 

35 So Ivl 2? ?S ? TA J* 0 TC6 GAG 000 GTC <*C CCC TTT 66G TTG 
His Arg Tyr Val Leu Lys Ser Glu Pro Val His Pro Phe Gly Leu 

2415 2420 2425 

X 51? 25 S £5 25 a ? t oJ c tgg act gac tgg gtg cgg 066 «* 

Ala Val Tyr Gly Glu His lie Phe Trp Thr Asp Trp Val Arg Arg Ala 

2435 2440 

vlt Sn SS S f 0 J* ^ T GTG GGC AGC GAC ATG *W CTG CTT CGG 
Val Gin Arg Ala Asn Lys Tyr Val Gly Ser Asp Met Lys Leu Leu Arg 

2^50 • 2455 

SI? a«« r? T d CC ^ ^ CCC ATG GGC ATC MC GCC GTG GCC AAT GAC 

2460 ° Gln 6l *,!£ W6t Gly lle lle Ala v *l Ala Asn Asp 

2460 2465 2470 

? C r JS? AGC J" GAA f TC TCC CCC TGC CGT ATC AAC AAT GGA GGC TGC 
Thr Asn Ser Cys Glu Leu Ser Pro Cys Arg lie Asn Asn Gly Gly Cys 
2480 2485 2490 

CAG GAT CTG TGT CTG CTC ACC CAC CAA GGC CAC GTC AAC TGT TCC TGT 
Gin Asp Leu Cys Leu Leu Thr His Gin Gly His Val Asn Cys Ser Cys 
2495 2500 2505 

Ha Til ??S ? G t? C CT ° £ AG GAG GAC TTC ACC TGC CGG GCT GTG AAC 
Arg Gly Gly Arg Ile Leu Gin Glu Asp Phe Thr Cys Arg Ala Val Asn 
2510 2515 2520 

III Vt T ? G ?? A ^ S** GAG TTT GAG T6T GCC AAT GGG GAA TGT 
Ser Ser Cys Arg Ala Gin Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys • 
2525 2530 2 535 

^ 11° f C CTC AC ° TGT GAT GGC GTC TCC CAC TGC AAG GAC AAG 
%f!E " L6U Thr . Cy f Asp G1 * Val Ser His Cys Lys Asp Lys 

2540 2545 2 5S0. - 

ISS til r A f f* 6 o CC I CC TAC T6C M C TCA CGC CGC TGC AAG AAG ACT 
25S5 P 73 2560 ^ ^ ly * Z 570 

TTC CGC CAG TGT AAC AAT GGC CGC TGT GTA TCC AAC ATG CTG TGG TGC 
Phe Arg Gin Cys Asn Asn Gly Arg Cys Val Ser Asn Met Leu Trp Cys 
2575 2580 2585 

AAT GGG GTG GAT TAC TGT GGG GAT GGC TCT GAT GAG ATA CCT TGC AAC 
Asn Gly Val Asp Tyr Cys Gly Asp Gly Ser Asp Glu Ile Pro Cys Asn 
2590 2595 2600 
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AAG ACT GCC TGT GGT GTG GGT GAG TTC CGC TGC CGG GAT GGG TCC TGC 8295 
Lys Thr Ala Cys Gly Val Gly Glu Phe Arc Cys Arg Asp Gly Ser Cys 
2605 2610 "2615 

ATC GGG AAC TCC AGT. CGC TGC, AAC CAG TTT GTG GAT TGT GAG GAT GCC 8343 
lie Gly Asn Ser Ser Arg. Cys Asn Gin Phe Val Asp Cys Glu Asp Ala 
2620 2625 2630 

TCG GAT GAG ATG AAT TGC AGT GCC ACA GAC TGC AGC AGC TAT TTC CGC 8391 ' 
Ser Asp Glu Met Asn Cys Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg 
2635 2640. 2645 2650 

CTG GGC GTG AAA GGT GTC CTC TTC CAG CCG TGC GAG CGG ACA TCC CTG 8439 
Leu Gly Val Lys Gly Val Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu 
2655 2660 ~ 2665 

TGC TAC GCA CCT AGC TGG GTG TGT GAT GGC GCC AAC GAC TGT GGA GAC 8487 
Cys Tyr Ala Pro Ser Trp Val Cys Asp Gly Ala Asn Asp Cys Gly Asp 
2670 2675 2680 

TAC AGC GAT GAA CGT GAC TGT CCA GGT GTG AAG CGC CCT AGG TGC CCG 8535 
Tyr Ser Asp Glu Arg Asp Cys Pro Gly Val Lys Arg Pro Arg Cys Pro 
2685. 2690 2695 

CTC AAT TAC TTT GCC TGC CCC AGC GGG CGC TGT ATC CCC ATG AGC TGG 8583 
Leu Asn Tyr Phe Ala Cys Pro Ser Gly Arg Cys He Pro Met Ser Trp 
2700 2705 2710 

ACG TGT GAC AAG GAG GAT GAC TGT GAG AAC GGC GAG GAT GAG ACC CAC 8631 
Thr Cys Asp Lys Glu Asp Asp Cys Glu Asn Gly Glu Asp Glu Thr His 
2715 2720 2725 2730 

TGC AAC AAG TTC TGC TCA GAG GCA CAG TTC. GAG TGC CAG AAC CAC CGG 8679 
Cys Asn Lys Phe Cys Ser Glu Ala Gin Phe Glu Cys Gin Asn His Arg 
2735 2740 2745 

TGT ATC TCC AAG CAG TGG CTG TGT GAC GGT AGC GAT GAT TGC GGG GAT 8727 , 
Cys He Ser Lys Gin Trp Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp 
2750 2755 2760 



GGC TCC GAT GAG GCA GCT CAC TGT GAA GGC AAG ACA TGT GGC CCC TCC 
Gly Ser Asp Glu Ala Ala His Cys Glu Gly Lys Thr Cys Gly Pro Ser* 
2765 2770 2775 



8775 



TCC TTC TCC TGT CCC GGC ACC CAC GTG TGT GTC CCT GAG CGC TGG CTC 8823 
Ser Phe Ser Cys Pro Gly Thr His Val Cys Val Pro Glu Arg Trp Leu 
2780 2785 2790 

TGT GAT GGC GAC AAG GAC TGT ACC GAT GGC GCG GAT GAG AGT GTC ACT 8871 
Cys Asp Gly Asp Lys Asp Cys Thr Asp Gly Ala Asp Glu Ser Val Thr 
2795 2800 2805 2810 

GCT GGC TGC CTG TAC AAC AGC ACC TGT GAT GAC CGT GAG TTC ATG TGC B919 
Ala Gly Cys Leu Tyr Asn Ser Thr Cys Asp Asp Arg Glu .Phe Met Cys 
2815 2820 2825 



CAG AAC CGC TTG TGT ATT CCC AAG CAT TTC GTG TGC GAC CAT GAC CGT 
Gin Asn Arg Leu Cys He Pro Lys His Phe Val Cys Asp His Asp Arg 
2830 2835 2840 
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GAC TGT GCT GAT GGC TCT GAT GAA TCC CCT GAG TGT GAG TAC CCA ACC 9015 
.Asp Cys Ala Asp Gly Ser Asp Glu Ser Pro Glu Cys Glu Tvr Pro Thr 
2845 2850 2 855 

TGC CGG CCC AAT GAA TTC CGC TGT GCC AAT GGG CGT TGT CTG AGC TCC 9063 
Cys Gly Pro Asn Glu Phe Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser 
2860 2865 2870 

CGT CAG TGG GAA TGT GAT GGG GAG AAT GAC TGT CAC GAC CAC AGC GAT 9m 
Arg Gin Trp Glu Cys Asp Gly Glu Asn Asp Cys His Asp His Ser Asp 
2875 2880. 2885 2890 

GAG GCT CCC AAG AAC CCA CAC TGC ACC AGC CCA GAG CAC AAA TGC AAT 9159 
Glu Ala Pro Lys Asn Pro His Cys Thr Ser Pro Glu His Lys Cys Asn 
2895 2900 2905 

GCC TCA TCA CAG TTC CTG TGC AGC AGC GGG CGC TGC GTG GCT GAG GCG 9207 
Ala Ser Ser Gin Phe Leu Cys Ser Ser Gly Arg Cys Val Ala Glu Ala 
2910 2915 2920 

TTG CTC TGC AAC GGC CAG GAC GAC TGT GGG GAC GGT TCA GAC GAA CGC 9255 
Leu Leu Cys Asn Gly Gin Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg 
2925 2930 2935 

GGG TGC CAT GTC AAC GAG TGT CTC AGC CGC AAG CTC AGT GGC TGC AGT 9303 
Gly Cys His Val Asn Glu Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser 
2940 2945 2950 

CAG GAC TGC GAG GAC CTC AAG ATA GGC TTT AAG TGC CGC TGT CGC CCG 9351 
Gin Asp Cys Glu Asp Leu Lys He Gly Phe Lys Cys Arg Cys Arg Pro 
2955 2960 2965 2970 

GGC TTC CGG CTA AAG GAC GAT GGC AGG ACC TGT GCC GAC CTG GAT GAG 9399 
Gly Phe Arg Leu Lys Asp Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu 
2975 2980 2985 

TGC AGC ACC ACC TTC CCC TGC AGC CAG CTC TGC ATC AAC ACC CAC GGA 9447 • 
Cys Ser Thr Thr Phe Pro Cys Ser Gin Leu Cys He Asn Thr His Gly 
2990 2995 3000 

» 

AGT TAC AAG TGT CTG TGT GTG GAG GGC TAT GCA CCC CGT GGC GGT GAC 9495 
Ser Tyr Lys Cys Leu Cys Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp- 
3005 3010 3015 

CCC CAC AGC TGC AAA GCT GTG ACC GAT GAG GAG CCA TTT CTC ATC TTT 9543 
Pro His Ser Cys Lys Ala Val Thr Asp Glu Glu Pro Phe Leu He Phe 
3020 3025 3030 

GCC AAC CGG TAC TAC CTG CGG AAG CTC AAC CTG GAC GGC TCC AAC TAC 9591 
Ala Asn Arg Tyr Tyr Leu Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr 
3035 3040 3045 3050 

ACA CTG CTT AAG CAG GGC CTG AAC AAT GCG GTC GCC TTG GCA TTT GAC 9639 
Thr Leu Leu Lys Gin Gly Leu Asn Asn Ala Val Ala Leu Ala Phe Asp 
3055 3060 * 3065 

TAC CGA GAG CAG AT<5 ATC TAC TGG ACG GGC GTG ACC ACC CAG GGC AGC 9687 
Tyr Arg Glu Gin Met He Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser 
3070 3075 308O 
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ATG ATT CGC AGG ATG CAC rrn 

Met lie Arg Arg Met His £1° ?° ^ A6C MC 616 GTT «« CAC 

3085 9 30oo Gly Ser Asn Val Gin Va * leu His 

JQ90 3095 

CG6 ACG GGC CTT AGT AAP <->r» _ 

oly u. s„ £ £ S 25 S !2 S 5 5 5 

3105 3110 



GGC AAC CTG TAC TGG TGT GAC AAG GGC AGA GAT ACC 
Gly Asn Leu Tyr ftp Cys Asp Lys Gly Arg As P £ 
3120 • 3125 

AAG CTT AAC GGG GCC TAT CGG ACA GTG CTG GTC AGC 
Lys Leu Asa Gly Ala Tyr Arg Thr Val Leu Val Ser 
3135 3140 

GAG CCC AGA GCT CTG GTA GTG GAT GTA CAG AAT err 
Glu Pro Arg Ala Leu Val Val Asp Si g£ J£ g 
3150 3155 



ATT GAG GTG TCC 
He Glu Val Ser 
3130 

TCT GGC CTC CGG 
Ser Gly Leu Arg 
3145 

TAC CTG TAC TGG 
Tyr Leu Tyr Trp 
3160 



S SjJ Z Z S S 2 S S 5 S§ S 5 3 
= JS - = S fig £££ SB 3 £ S S 

CTG ACC GTG GAC TAC GTC 
Leu Thr Val Asp Tyr Val 
3195 3200 



GAG GAC TAC ATC GAG TTC 
Glu Asp Tyr lie Glu Phe 
3215 

GTG CTG AGC CAA GAC ATC 
Val Leu Ser Gin Asp He 
3230 



ACG GAA CGC ATC TAC TGG GCT GAC GCC CGT 
Thr Glu Arg He Tyr Trp Ala Asp Ala Arg 
3205 * 3210 

GCC AGC CTG GAT GGC TCC AAC CGT CAC GTT 
Ala Ser Leu Asp Gly Ser Asn Arg His Val 
3220 3225 

CCA CAC ATC TTT GCG CTG ACC CTA TTT GAA 
Pro Hxs He Phe Ala Leu Thr Leu Phe Glu 
323 5 3240 

as P 2S 55 JE S «£ SE 5 S* !? TCC ATC "* CGG GCC 

«s P Tyrval Tyr Trp Thr Asp^Trp Glu Thr Lys Ser He Asn Arg Ala 

3255 

25 Lys Thr ?£ Glv S J* CTC CTC ATC AGC ACC CTG «C 

3260 7 LyS Thr Leu Leu 116 Ser «« Leu His 

o^bS 3270 ■ 

s s - e ss ss a sss 

3280 3285 3290 

Pro j£ So 55 Lvs SI? ?° f T GGT GGG TGC AGC CTG TGC 
Pro Asn Hxs Pro Cys Lys Val Asn Asn Gly Gly Cys Ser Asn Leu Cys 

3295 3300 ' 3305 

CTG CTG TCC -CCT GGG GGT GGT CAC AAG TGC GCC TGC CCC ACC AAC TTC 
Leu Leu Ser Pro Gly Gly Gly His Lys Cys Ala lf a Pro £ is"n Z 
3310 331 5 3320 
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•55 SS SJ SS 2S X ? T 8? TGT GTG TCC *>* ™C ACA GCA AGC 

3325 7 P y Cys Val Ser <* s *hr Ala Ser 

- " 3330 3335 

Gil III SI? r G ° ?** ^ GAC ^ TGC ATC CCC ™ TG6 TGG AAG TGT 
* 3345 33so • 

i? ^ G « AC f C TGT GGG GAT CAC TCA GAG CCT CCA GAC TGT 
Asp Thr Glu Asp Asp Cys Gly Asp His Ser Asp Glu Pro Pro Asp Cys 

3355 3360 . 3365 3370 

CCC GAG TTC AAG TGC CGC CCA GGC CAG TTC CAG TGC TCC ACC GGC ATC 
Pro Glu Phe Lys Cys Arg Pro Gly Gin Phe Gin Cys Ser Thr Giy lie 
3375 3380 338 | 

TGC ACC AAC CCT GCC TTC ATC TGT GAT GGG GAC AAT GAC TGC CAA GAC 
Cys Thr ton Pro Ala Phe lie Cys Asp Gly Asp Asn Asp Cys Gin Asp 
3390 3395 3400 

AAT AGT GAC GAG GCC AAT TGC GAC ATT CAC GTC TGC TTG CCC AGC CAA 
Asn Ser Asp Glu Ala Asn Cys Asp He His Val Cys Leu Pro Ser Gin 
3405 3410 3415 

TTC AAG TGC ACC AAC ACC AAC CGC TGC ATT CCT GGC ATC TTC CGT TGC 
Phe Lys Cys Thr Asn Thr Asn Arc, Cys He Pro Gly He Phe Arg Cys 
3420 3425 3430 

AAT GGG CAG GAC AAC TGC GGG GAC GGC GAG GAT GAG CGG GAT TGC CCT 
Asn Gly Gin Asp Asn Cys Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro 
3435 3440 3445 3450 

r fP. v TG ACC 2 GC GCC CCC A* 0 CAG TTC CAG TGC T CC ATC ACC AAG CGC 
Glu Val Thr Cys Ala Pro Asn Gin Phe Gin Cys Ser He Thr Lys Arg 
34 55 3460 3465 

TGC ATC CCT CGC GTC TGG GTC TGT GAC AGG GAT AAT CAC TGT GTG GAC 
Cys He Pro Arg Val Trp Val Cys Asp Arg Asp Asn His Cys Val Asp 
3470 3475 348O 

GGC AGT GAT GAG CCT GCC AAC TGT ACC CAA ATG ACC TGT GGA GTG GAT 
Gly Ser Asp Glu Pro Ala Asn Cys Thr Gin Met Thr Cys Gly Val Asp 
3485 3490 3495 * 

GAG TTC CGC TGC AAG GAT TCT GGC CGC TGC ATC CCC GCG CGC TGG AAG 

•ifSn Ar ^ CyS LyS ASP Ser Arg Cys He Pro Ala Arg Trp Lys 
3500 3505 3510 

TGT GAC GGA GAA GAT GAC TGT GGG GAT GGT TCA GAT GAG CCC AAG GAA 
Cys Asp Gly Glu Asp Asp Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu 
3515 3520 3525 3530 

GAG TGT GAT GAG CGC ACC TGT GAG CCA TAC CAG TTC CGC TGC AAA AAC 
Glu Cys Asp Glu Arg Thr Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn 
3535 3540 3545 

AAC CGC TGT GTC CCA GGC CGT TGG CAA TGT GAC TAC GAC AAC GAC TGC 
Asn Arg Cys Val Pro Gly Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys 
3S50 3555 356O 
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S 5£ sS % trz 3S S 5 2! s s as 

3570 3575 

stl SS S Ss aS f T ?f f C T6C ATC GCT GGG CGC TGG AAG 

3580 7 AIa 3 5ls Gly Ar9 C * S JJj «y Arg Trp Lys 

SI Asn Sv Asn H?I 2° ?* G f GAC GGC TCA GAC <*G AAA GAC TGC 
Cys Asp Gly Asp His Asp Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys 

3999 3600 • 3605 3610 

ACC CCC CGC TGT GAT ATG GAC CAG TTC CAG TGC AAG AGT GGC CAC TGC 
Thr Pro Arg Cys Asp Met Asp Gin Phe Gin Cys tjs Ser Gly 25 Cys 
3515 3620 362S 

X So Su 2 J? P° G J" GAC GCG GM GCT GAC * GT ATG GAC GGC 
lie Pro Leu Arg Trp Pro Cys Asp Ala Asp Ala Asp Cys Met Asp Gly 

3630 3635 3640 

AGT GAC GAG GAA GCC TGT GGC ACT GGG GTG AGG ACC TGC CCA TTG GAT 
Ser Asp Glu Glu Ala Cys Gly Thr Gly Val Arg Thr Cys Pro 2! AsJ 
3645 3650 3655 

GAG TTT CAA TGT AAC AAC ACC TTG TGC AAG CCG CTG GCC TGG AAG TGT 

1 s2S yS ASn ASn , Thr Leu <*« *** Pr ° Leu Ala Trp Lys Cys 

■ ?66 ° 3665 3670 

Sn G G i SS A- GAC > CT J?° GAC ^ TCA GAT GAG MC CCC GAG GAA 
Asp Gly Glu Asp Asp Cys Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu 

3675 3680 3685 3690 

SS S? C SZ ph° tT C CCT CCC MC 066 CCT TTC C6C ^ AAG AAT 

Cys Ala Arg Phe lie Cys Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn 

369 * 3700 J 3705 

GAC CGA GTC TGC CTG TGG ATT GGG CGC CAG TGT GAT GGC GTG GAC AAC 
Asp Arg Val Cys Leu Trp lie Gly Arg Gin Cys Asp Gly Val Asp Asn 
3710 3715 3720 

TGT GGA GAT GGG ACT GAC GAG GAG GAC TGT GAG CCC CCC ACG GCC CAG 
Cys Gly Asp Gly Thr Asp Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin 
3725 3730 3735 

AAC CCC CAC TGC AAA GAC AAG AAG GAG TTC CTG TGC CGA AAC CAG CGC 

HlS Cys tys ** p Lys L * s Glu Phe Leu Cys Arg Asn Gin Arg 
3740 3745 3 75o • 

TGT CTA TCA TCC TCC CTG CGC TGT AAC ATG TTC GAT GAC TGC GGC GAT 
Cys Leu Ser Ser Ser Leu Arg Cys Asn Met Phe Asp Asp Cys Gly Asp 
3755 37 *0 3765 3770 

GGC TCC GAT GAA GAA GAT TGC AGC ATC GAC CCC AAG CTG ACC AGC TGT 
Gly Ser Asp Glu Glu Asp Cys Ser lie Asp Pro Lys Leu Thr Ser Cys 
3 "S 3780 3785 

GCC ACC AAT. GCC AGC ATG TGT GGG GAC GAA GCT CGT TGT GTG CGC ACT 
Ala Thr Asn Ala Ser Met Cys Gly Asp Glu Ala Arg Cys Val Arg Thr 
379 ° 3795 * 3 8 oo 
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GAG AAA GCT GCC TAC TGT GCC TGC CGC TCG GGC TTC CAT ACT GTG CCG 11895 
Glu Lys Ala Ala Tyr Cys Ala Cys Arg Ser Gly Phe His Thr Val Pro 
3805 3810' 3815 

GGC CAG CCC GGA TGC CAG GAC ATC AAC GAG TGC CTG CGC TTT GGT ACC 11943 

Gly ^™ Pr ° Gly CyS Gln As P Ile Asn GIu C V S Arg Phe Gly Thr 
3820 3825 3830 

TGC TCT CAG CTC TGG AAC AAA CCC AAG GGA GGC CAC CTC TGC AGC TGT 11991 
Cys Ser Gin Leu Trp Asn Lys Pro Lys Gly Gly His Leu Cys Ser Cys 
3835 3 840 3845 3850 

GCC CGC AAC TTC ATG AAG ACA CAC AAC ACC TGC AAA GCT GAA GGC TCC 12039 
Ala Arg Asn Phe Met Lys Thr His Asn Thr Cys Lys Ala Glu Gly Ser 
38 55 3860 3865 

GAG TAC CAG GTG CTA TAC ATC GCG GAT GAC AAC GAG ATC CGC AGC TTG 12087 
Glu Tyr Gin Val Leu Tyr lie Ala Asp Asp Asn Glu Ile Arg Ser Leu 
3970 3875 3880 

TTC CCG GGC CAC CCC CAC TCA GCC TAC GAG CAG ACA TTC CAG GGC GAT 12135 
Phe Pro Gly His Pro His Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp 
3885 3890 3895 

GAG AGT GTC CGC ATA GAT GCC ATG GAT GTC CAT GTC AAG GCC GGC CGT 12183 
Glu Ser Val Arg lie Asp Ala Met Asp Val His Val Lys Ala Gly Arg 
3900 3905 3910 

GTC TAC TGG ACT AAC TGG CAC ACG GGC ACA ATC TCC TAC AGG AGC CTG 12231 
Val Tyr Trp Thr Asn Trp His Thr Gly Thr lie Ser Tyr Arg Ser Leu 
3915 3920 3925 3930 

CCC CCT GCC GCC CCT CCT ACC ACT TCC AAC CGC CAC CGG AGG CAG ATC 12279 
Pro Pro Ala Ala Pro Pro Thr Thr Ser Asn Arg His Ar^ Arg Gin Ile 
3935 3940 " 3945 

GAC CGG GGT GTC ACC CAC CTC AAT ATT TCA GGG CTG AAG ATG CCG AGG 12327 
Asp Arg Gly Val Thr His Leu Asn Ile Ser Gly Leu Lys Met Pro Arg 
3950 3955 3960 

GGT ATC GCT ATC GAC TGG GTG GCC GGG AAT GTG TAC TGG ACC GAT TCC 12375 
Gly lie Ala Ile Asp Trp Val Ala Gly Asn Val Tyr Trp Thr Asp Ser 
3965 3970 3975 

GGC CGA GAC GTG ATT GAG GTG GCG CAA ATG AAG GGC GAG AAC CGC AAG 12423 
Gly Arg Asp Val He Glu Val Ala Gin Met Lys Gly Glu Asn Arg Lys 
3980 3985 3990 

ACG CTC ATC TCG GGC ATG ATT GAT GAG CCC CAT GCC ATC GTG GTG GAC 12471 
Thr Leu He Ser Gly Met Ile Asp Glu Pro His Ala Ile Val Val Asp 
3 995 4000 4005 4010 

CCT CTG AGG GGC ACC ATG TAC TGG TCA GAC TGG GGG AAC CAC CCC AAG 12519 
Pro Leu Arg Gly Thr Met Tyr Trp Ser Asp Trp Gly Asa His Pro Lys 
4015 4020 * 4025 

ATT GAA ACA GCA GCG ATG GAT GGC ACC CTT CGG GAG ACT CTC GTG CAA 12567 
Ile Glu Thr Ala Ala Wet Asp Gly Thr Leu Arg Glu Thr Leu Val Gin 
4030 4035 4040 
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GAC AAC ATT CAG TGG CCT ACA GGG CTG GOT GTG GAC TAT CAC AAT GAA 
Asp Asn He Gin Trp Pro Thr^ly Leu Ala Va? Spjyr £s £n Glu 

S £r S S£ Aso S i* f* ™ ^ AT ° GGC A6C ATC CGG 
4060 y P P an« 78 LSU Ser V ^ Ue G1 y Ser a* **g 

. 4 °65 4070 

CTC AAC GGC ACT GAC CCC ATT GTG GCT GCT GAC AGC AAA CGA GGC CTA 
Leu Asn Gly Thr Asp Pro lie Val Ala Ala Asp Ser £yt 2g Sg 2 
4075 4080 4085 40 90 

Jer £5 Pro S X rf ?* P? TTT GA * GAC TAC ATC TAC GGA GrC 
Ser His Pro Phe Ser He Asp Val Phe Glu Asp Tyr He Tyr Gly Val 

,os,:> 4100 4ios 

JS Ty"r He JE iE 22 ?? ^ J** ATC CAC *«« TTT GGA CAC AGC 
Thr Tyr He Asn Asn Arg Val Phe Lys He His Lys Phe Gly His Ser 

1AO 4115 4120 

CCC TTG TAC AAC CTA ACT GGG GGC CTG AGC CAT GCC TCT GAT GTA GTC 
Pro Leu Tyr Asn Leu Thr Gly Gly Leu Ser His Ala ler Asp Sal Val 
4125 4130 4X35 

Zl Jtr Sil SK Sf 2" f G ^ GTG ACC CCC TGT G * C CGC 

i??? yS Gln Pr ° Glu Val Thr Asn P«> Cys Asp Arg 

* A * U 4145 41S0 

AAG AAA TGC GAA TGG CTG TGT CTG CTG AGC CCC AGC GGG CCT GTC TGC 
Lys Lys Cys Glu Trp Leu Cys Leu Leu Ser Pro Ser Gly Pro Val Cys 
4 * ss 4160 4i.6 5 417Q 

Thr III Pro r? A f* 5 AGG CTG GAT ** T GGC ACC * GT GTG CCT GTG 
Thr Cys Pro Asn Gly Lys Arg Leu Asp Asn Gly Thr Cys Val Pro Val 

4175 4180 4185 

CCC TCT CCA ACA CCC CCT CCA GAT GCC CCT AGG CCT GGA ACC TGC ACT 
Pro Ser Pro Thr Pro Pro Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr 
4 190 4195 4200 

CTG- CAG TGC TTC AAT GGT GGT ACT TGT TTC CTC AAC GCT CGG AGG CAG 
Leu Gin Cys Phe Asn Gly Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin 
4205 4210 4215 

CCC AAG TGC CGT TGC CAG CCC CGT TAC ACA GGC GAT AAG TGT GAG CTG 

yS Ar9 CyS Gln Pro *"» T * r Thr Gly Asp Lys Cys Glu Leu 
* zzo ' 4225 4230 

GAT CAG TGC TGG GAA TAC TGT CAC AAC GGA GGC ACC TGT GCG GCT TCC 
Asp Gln Cys Trp Glu Tyr Cys His Asn Gly Gly Thr Cys Ala Ala Ser 
4235 4240 4245 4250 

St III rf £ TG » CC ACG TGC CGC ^GT CCC ACT GGC TTC ACG GGC CCC 
Pro Ser Gly Met Pro Thr Cys Arg Cys Pro. Thr Gly Phe Thr Gly Pro 
4255 4260 * 4265 

AAA TGC ACC GCA CAG GTG TGT GCA GGC TAC TGC TCT AAC AAC AGC ACC 
Lys Cys Thr Ala Gln Val Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr 
4270 4275 4 2 80 
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TGC ACC GTC AAC CAG GGC AAC CAG CCC CAG TGC CGA TGT CTA CCT GGC 
Cys Thr Val Asn Gin Gly Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly 
4285 4290 4295 



TTC CTG GGC GAC CGT TGC CAG TAC CGG CAG TGC TCT GGC TTC TGT GAG 
Phe Leu Gly Asp Arg Cys Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glu 
4300 4305 ' 4310 . 

AAC TTT GGC ACC TGT CAG ATG GCT GCT GAT GGC TCC CGA CAA TGT CGC 
Asn Phe Gly Thr Cys Gin Met Ala Ala Asp Gly Ser Arg Gin Cys Arg 
4315 4320 . 4325 4330 

TGC ACC GTC TAC TTT GAG GGA CCA AGG TGT GAG GTG AAC AAG TGT AGT 
Cys Thr Val Tyr Phe Glu Gly Pro Arg Cys Glu Val Asn Lys Cys Ser 
4335 4340 4345 

CGC TGT CTC CAA GGC GCC TGT GTG GTC AAT AAG CAG ACC GGA GAT GTC 
Arg Cys Leu Gin Gly Ala Cys Val Val Asn Lys Gin Thr Gly Asp Val 
4350 4355 4360 

ACA TGC AAC TGC ACT GAT GGC CGG GTA GCC CCC AGT TGT CTC ACC TGC 
Thr Cys Asn Cys Thr Asp Gly Arg Val Ala Pro Ser Cys Leu Thr Cys 
4365 4370 4375 



13335 



13383 



13431 



13479 
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ATC GAT CAC TGT AGC AAT GGT GGC TCC TGC ACC ATG AAC AGC AAG ATG 
lie Asp His Cys Ser Asn Gly Gly Ser Cys Thr Met Asn Ser Lys Met 
4380 4385 4390 



13623 



ATG CCT GAG TGC CAG TGC CCG CCC CAT ATG ACA GGA CCC CGG TGC CAG 
Met Pro Glu Cys Gin Cys Pro Pro His Met Thr Gly Pro Arg Cys Gin 
4395 4400 . 4405 4410 



13671 



GAG CAG GTT GTT AGT CAG CAA CAG CCT GGG CAT ATG GCC TCC ATC CTG 
Glu Gin Val Val Ser Gin Gin Gin Pro Gly His Met Ala Ser lie Leu 
4415 4420 4425 



13719 



ATC CCT CTG CTG CTG CTT CTC CTG CTG CTT CTG GTG GCT* GGC GTG GTG 
He Pro Leu Leu Leu Leu Leu Leu Leu Leu Leu Val Ala Gly Val Val 
4430 4435 4440 



13767 



TTC. TGG TAT AAG CGG CGA GTC CGA GGG GCT AAG GGC TTC CAG CAC CAG 
Phe Trp Tyr Lys Arg Arg Val Arg Gly Ala Lys Gly Phe Gin His Gin 
4445 4450 4455 



13815 



CGG ATG ACC AAT GGG GCC ATG AAT GTG GAA ATT GGA AAC CCT ACC TAC 
Arg Met Thr Asn Gly Ala Met Asn Val Glu He Gly Asn Pro Thr Tyr 
4460 4465 4470 



13863 



AAG ATG TAT GAA GGT GGA GAG CCC GAT GAT GTC GGG GGC CTA CTG GAT 
Lys Met Tyr Glu Gly Gly Glu Pro Asp Asp Val Gly Gly Leu Leu Asp 
4475' 4480 4485 4490 



13911 



GCT GAT TTT GCC CTT GAC CCT GAC AAG CCT ACC AAC TTC ACC AAC CCA 
Ala Asp Phe Ala Leu Asp Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro . 

4495 4500 * 4505 



13959 



GTG TAT GCC ACG CTC TAC ATG GGG GGC CAC GGC AGC CGC CAT TCC CTG 
Val Tyr Ala Thr Leu Tyr Met Gly Gly His Gly Ser Arg His Ser Leu 
4510 4515 4520 



14007 
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GCC AGC ACG GAC GAG AAG CGA GAA CTG CTG GGC CGG GGA CCT GAA GAC 
Ala Ser Thr Asp Glu Lys Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp 
4525 4530 4535 



14055 



GAG ATA GGA GAT CCC TTG GCA TAGGGCCCTG CCCCGACGGA TGTCCCCAGA AAGC 14110 
CCCCTGCCAC ATGAGTCTTT CAATGAACCC CCTCCCCAGC CGGCCCfTCT CCGGCCCTGC 14170 
Glu lie Gly Asp Pro Leu Ala 
4540 4545 



CGGGTGTACA 
AGCACAGTAT 
GCCTTGAGGG 
TACCGAGCAT 
CTCAACGGGG 
AGGTGGAGTC 
TTAGTTGAGG 
CCATGCTCAG 
TAGGGCTGAA 
GAAGAGTCGG 
TTTTAATAAT 
AATTGTAAAA 



AATGTAAAAA 
TATCTCTTTG 
GGCGGGGAGG 
ACCCCTGGAA 
CTCCGTGTCC 
CTCTGCTGAC 
GAAGTCACCC 
TATCCCTTCC 
CACATTCTTT 
GGCACACCAA 
TTTTGCTGAA 
AAAAAAAAA 



TGAAGGAATT 
CATTTCCTTC 
GCTTTGTGGC 
GCCTGGCGGC 
CAGCTTTCCT 
CCTGTCTGGA 
CAAACCCCAG 
AGACAGGCCC 
GGTAACTGTC 
GGAAGGGCAA 
TTCCTTTACA 



ACTTTTTATA 
CTGCCTGCTC 
TCAAAGGTAT 
ACGGCCTCCC 
TTCCTTGGCT 
AGATTTGGCT 
CTCCCACTTT 
TCCCCTCTCT 
CCCCAAGCCT 
GCGGGCAGCC 
ACTAAATAAC 



TGTGAGCGAG 
CTCAGTATCC 
GAAGGAGTCC 
CACCACGCCT 
CTCTGGGGTT 
CTAGCTGAGG 
CAGGGGCACG 
AGCGCCCCCT 
CCCATCCCCC 
CCATTTTGGG 
ACAGATATTG 



CAAGCGAGCA 
CCCCCATGCT 
ACATGTTCCC 
GTGCAAGACA 
AGTTCAGGGG 
AAGGAGTCTT 
TCTCAGATGG 
CTGTGGCTCC 
TGAGGGCCAG 
GACGTGAACG 
TTATAAATAA 



14230 
14290 
14350 
14410 
14470 
14530 
14590 
14650 
14710 
14770 
14830 
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Leu Val Pro Leu Leu Ser Ala Leu 

10 15 
Pro Lys Thr Cys Ser Pro Lys Gin 
? 5 30 
Cys lie Ser Lys Gly Trp Arg Cys 
45 

Gly. Ser Asp Glu Ala Pro Glu He 
60 

Cys Pro Pro Asn Glu His Ser Cys 

W 80 
Met Ser Arg Leu Cys Asn Gly lie 

9 ° 95 
Glu Gly Ala His Cys Arg Glu Leu 
!°5 110 
Cys Gin His His Cys Val Pro Thr 
125 

Asn Ser Ser Phe Gin Leu Glu Ala 
140 

Asp Glu Cys Ser Val Tyr Gly Thr 
155 160 
Asp Gly Ser Phe Thr Cys Gly Cys 

170 175 
Asp Asn Arg Ser Cys Lys Ala Lys 
185 l9Q 
Val Leu Leu He Ala Asn Ser Gin 
205 

Gly Ala Gin Val Ser Thr He Thr 
220 

Ala Met Asp Phe Ser Tyr Ala Asn 
. 235 240 
Gly Asp Ser Ala Ala Gin Thr Gin 

250 255 
Leu Lys Gly Phe Val Asp Glu His 
265 270 
His His 'Val Glu Gin Met Ala lie 
285 

Phe Val Asp Asp He Asp Asp Arg 
300 

Asp Thr Cys Val Thr Leu Leu Asp 
315 320 
Xle Ala Leu Asp Pro Ala Met Gly 

330 335 
Gin He Pro Xys Val Glu Arg Cys 
345 350 
Lys Leu Val Asp Ser Lys He Val 
365 

Leu Val Ser Arg Leu Val Tyr Trp 

380 ' 
Glu Val Val Asp Tyr Glu- Gly Lys 
395 400 
He Leu He Glu His Leu Tyr Gly 

410 415 
Tyr Ala Thr Asn Ser Asp Asn Ala 
425 430 
He Arg Val Asn Arg Phe Asn Ser 
445 

Val Asp Lys Gly Gly Ala Leu His 
460 
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lie Tyr His Gin Arg Arg Gin Pro Arg Val Arg Ser His Ala Cys Glu 
465 470 ' 475 480 

Ash Asp Gin Tyr Gly Lys Pro Gly Gly Cys Ser Asp lie Cys Leu Leu 

485 . 490 495 

Ala Asn Ser His Lys Ala Arg Thr Cys Arg Cys Arg Ser Gly Phe Ser 

500 505 510 

Leu Gly Ser Asp Gly Lys Ser Cys Lys Lys Pro Glu His Glu Leu Phe 

515 520 525 

Leu Val Tyr Gly Lys Gly Arg Pro Gly lie lie Arg Gly Met Asp Met 

530 535 540 

Gly Ala Lys Val Pro Asp Glu His Met lie Pro lie Glu Asn Leu Met 
545 550 555 560 

Asn Pro Arg Ala Leu Asp Phe His Ala Glu Thr Gly Phe lie Tyr Phe 

565 570 575 

Ala Asp Thr Thr Ser Tyr Leu lie Gly Arg Gin Lys lie Asp Gly Thr 

580 585 59a 

Glu Arg Glu Thr lie Leu Lys Asp Gly lie His Asn Val Glu Gly Val 

595 600 605 

Ala Val Asp Trp Met Gly Asp Asn Leu Tyr Tip Thr Asp Asp Gly Pro 

610 615 620 

Lys Lys Thr lie Ser Val Ala Arg Leu Glu Lys Ala Ala Gin Thr Arg 
625 630 635 640 

Lys Thr Leu lie Glu Gly Lys Met Thr His Pro Arg Ala lie Val Val 

645 650 655 

Asp Pro Leu Asn Gly Trp Met Tyr Trp Thr Asp Trp Glu Glu Asp Pro 

660 ' 665 670 

Lys Asp Ser Arg Arg Gly Arg Leu Glu Arg Ala Trp Met Asp Gly Ser 

675 680 685 

His Arg Asp lie Phe Val Thr Ser Lys Thr Val Leu Trp Pro Asn Gly 

690 695 700 

Leu Ser Leu Asp lie Pro Ala Gly Arg Leu Tyr Trp Val Asp Ala Phe 
70S 710 715 720 

Tyr Asp Arg lie Glu Thr lie Leu Leu Asn Gly Thr Asp Arg Lys lie 

725 730 735 

Val Tyr Glu Gly Pro Glu Leu Asn His Ala Phe Gly Leu Cys His His 

740 745 750 

Gly Asn Tyr Leu Phe Trp Thr Glu Tyr Arg Ser Gly -Ser Val Tyr Arg 

755 760 765 

Leu Glu Arg Gly Val Ala Gly Ala Pro Pro Thr Val Thr Leu Leu Arg 

770 775 780 

Ser Glu Arg Pro Pro lie Phe Glu lie Arg Met Tyr Asp Ala His Glu 
785 790 795 800 

Gin Gin Val Gly Thr Asn Lys Cys Arg Val Asn Asn Gly Gly Cys Ser 

805 810 815 

Ser Leu Cys Leu Ala Thr Pro Gly Ser Arg Gin Cys Ala Cys Ala Glu 

820 825 830 

Asp Gin Val Leu Asp Thr Asp Gly Val Thr Cys Leu Ala Asn Pro Ser 

835 840 845- 

Tyr Val Pro Pro Pro Gin Cys Gin Pro Gly Gin Phe Ala Cys Ala Asn 

850 855 860 

Asn Arg Cys lie Gin Glu Arg Trp Lys Cys Asp Gly Asp Asn Asp Cys 
865 * 870 875 880 

Leu Asp Asn Ser Asp Glu Ala Pro Ala Leu Cys His Gin His Thr Cys 

885 890 . 895 

Pro Ser Asp Arg Phe Lys Cys GJu Asn Asn Arg Cys lie Pro Asn Arg 

900 905 910 

Trp Leu Cys Asp Gly Asp Asn Asp Cys Gly Asn Ser Glu Asp Glu Ser 
915 920 925 
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Asn Ala Thr Cys Ser Ala Arg Thr Cys Pro Pro Asn Gin Phe Ser Cys 

930 935 940 

Ala Ser Gly Arg Cys lie Pro lie Ser Trp Thr Cys Asp Leu Asp. Asp 
945 950 955 960 

Asp Cys Gly Asp Arg Ser Asp Glu Ser Ala Ser Cys Ala Tyr Pro Thr 

965 970 975 

Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn Asn Gly Arg Cys lie Asn 

980 985 990 

lie Asn Trp Arg Cys Asp Asn Asp Ash Asp Cys Gly Asp Asn Ser Asp 

995 1000 1005 

Glu Ala Gly Cys Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn 

1010 1015 1020 

Ser Gly Arg Cys lie Pro Glu His Trp Thr Cys Asp Gly Asp Asn Asp 
025 1030 1035 1040 

Cys Gly Asp Tyr Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin Ala 

1045 1050 1055 

Thr Arg Pro Pro Gly Gly Cys His Ser Asp Glu- Phe Gin Cys Pro Leu 

1060 1065 1070 

Asp Gly Leu Cys lie Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp 

1075 1080 1085 

Cys Met Asp Ser Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val 

1090 1095 ~ 1100 

Cys Asp Pro Asn Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys lie 
105 1110 1U5 1120 

Ser Lys .Ala Trp Val Cys Asp Gly Asp Ser Asp Cys Glu Asp Asn Ser 

1125 H30 1135 

Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys Arg Pro Pro Ser His Pro 

1KQ 1145 "* 1150 

Cys Ala Asn Asn Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp 

1155 1160 1165 

Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp 

1170 1175 H80 

Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala 
185 1190 H95 1200 

Pro Gly Glu Gly He Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly 

1205 1210 1215 

Ser Asp Asn His Thr Cys Gin He Gin Ser Tyr Cys Ala Lys His Leu 

1220 1225 1230 

Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser 

1235 1240 1245 

Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Thr Cys Arg. Ser 

1250 1255 1260 

Leu Asp Pro Phe Lys Leu Phe He He Phe Ser Asn Arg His Glu lie ' 
265 1270 1275 1280 

Arg Arg lie Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly 

1285 1290 1295 

Leu Arg Asn Thr lie Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu 

1300 1305 . 1310 

Tyr Trp Thr Asp Ala Val Glu Asp Lys He Tyr Arg Gly Lys Leu Leu 

1315 1320 1325 

Asp Asn Gly Ala Leu Thr Ser Phe Glu Val Val He Gin Tyr Gly Leu 

1330 1335 1340 

Ala Thr Pro Glu Gly Leu Ala Val Asp Trp He Ala Gly Asn He Tyr 
345 1350 1355 1360 

Trp Val Glu Ser Asn Leu Asp Gin He Glu Val Ala Lys Leu Asp Gly 

1365 1370 1375 

Thr Leu Arg- Thr Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala 

1380 1385 1390 

He Ala Leu Asp Pro Arg Asp Gly He Leu Phe Trp Thr Asp Trp Asp 
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I 395 1400 1405 

A1 %!?a LeU Pr ° tog Ile Glu Ala Ala Ser Met Se * <31y Ala Gly Arg 

1410 1415 1420 ■ 

Arg Thr Ile His Arg Glu Thr Gly Ser Gly Gly Cys Ala Asn Gly Leu 
ll S „ t i 43 ° 1435 1440 

Thr Val Asp Tyr Leu Glu Lys Arg Ile Leu Trp Ile Asp Ala Arg Ser 

1445 1450 1455 

Asp Ala Ile Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val 

1460 1465 1470 

Leu Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr 

1475 1480 1485 

Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys 

1490 1495 1500 

Ala Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn 
22* n _ 1510 1515 1520 

Thr Gin Pro Phe Asp Leu Gin Val Tyr Kis Pro Ser Arg Gin Pro Met 

1525 1530 1535 

Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly Arg Gly Pro Cys Ser His 

1540 1545 " 1550 

Leu Cys Leu lie Asn Tyr Asn Arg Thr Val Ser Trp Ala Cys Pro His 

1555 1560 1565 

Leu Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys 

1570 1575 1580 

Phe Leu Leu Tyr Ala Arg Gin Met Glu Ile Arg Gly Val Asp Leu Asp 
5S5 1590 1595 - 1600 

Ala Pro Tyr Tyr Asn Tyr lie Ile Ser Phe Thr Val Pro Asp Ile Asp 

1605 1610 1615 

Asn Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp 

1620 1625 1630 

Ser Asp Val Arg Thr Gin Ala lie lys Arg Ala Phe lie Asn Gly Thr 

1635 1640 1645 

Gly Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala Kis Gly Leu 

1650 1655 1660 

Ala Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr 
665 1670 1675 " 1680 

Asn Lys Lys Gin Ile Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn 

1685 1690 169l 

Ala Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro 

1700 1705 * 1710 

Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp- Asn lie Ser Met Ala 

"15 1720 1725 

Asn Met Asp Gly Ser Asn His Thr Leu Leu Phe Ser Gly Gin Lys Gly 

1730 1735 1740 

Pro Val Gly Leu Ala Ile Asp Phe Pro Glu Ser Lys Leu Tyr Trp Ile 
74 5 1750 1755 1760 

Ser Ser Gly Asn His Thr lie Asn Arg Cys Asn Leu Asp Gly Ser Glu 

1765 1770 ' 1775 

Leu Glu Val lie Asp Thr Met Arg Ser Gin Leu Gly Lys Ala Thr Ala 

1780 1785 1790 

Leu Ala lie Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu 

1795 1800 1805 • 

Lys Met Gly Thr Cys Asn Lys Ala Asp Gly Ser Gly Ser Val Val Leu 

1810 1815 1820 

Arg Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser 
82 5 1830 183$ 1840 

lie Gin Leu Glu His Glu Gly Thr Asn Pro Cys Ser Val Asn Asn Gly 

1845 1850 1855 

Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys 
I860 1865 1870 
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Met Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu 

1B75 1880 1885 

Gly Val Gly Ser Phe Leu Leu Tyr Ser Val His Glu Gly He Arg Gly 

1890 1895 1900 

He Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser 
905 1910 1915 1920 

Gly Thr Ser Leu Ala Val Gly He Asp Phe His Ala Glu Asn Asp Thr 



Asp Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly He Gly Arq Val 

1955 i960 1965 

Glu Gly He Ala Val Asp Trp He Ala Gly Asn He Tyr Trp Thr Asp 

1970 1975 1980 

Gin Gly Phe Asp Val He Glu Val Ala Arg Leu Asn Gly Ser Phe Arg 
985 1990 1995 2000 

Tyr Val Val He Ser Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val 

2005 2010 2015 

His Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly His Tyr Pro 

2020 2025 " 2030 

Arg He Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val 

2035 2040 2045 

Asn Val Ser He Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Gly 

2050 2055 ' 2060 

Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met Asp Lys He Glu Arg He 
065 2070 2075 2080 

Asp Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn 

2085 2090 2095 

Met Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser 

2100 2105 2110 

Asp Arg Thr His Ala Asn Gly Ser lie Lys Arg Gly Cys Lys Asp Asn 

2115 2120 2125 

Ala Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys 

2130 2135 2140 

Asp He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys 
145 2150 2155 2160 

Ala Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Gly 

2165 2170 2175 

Gly Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly 

2180 2185 2190 

Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr 

2195 2200 2205 

lie Leu Lys Ser lie His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro 

2210 2215 2220 

Val Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val lie Ala Leu 
225 2230 2235 2240 

Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg lie 

2245 2250 2255 

Phe Phe Ser Asp He His Phe Gly Asn lie Gin Gin He Asn Asp Asp 

2260 2265 2270 

Gly Ser Gly Arg Thr Thr He Val Glu Asn Val Gly Ser Val Glu Gly 

2275 -2280 2285 

Leu Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr 

2290 2295 2300 

Thr Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala 
305 2310 2315 2320 

Phe Glu Arg- Glu Thr Val He Thr Met Ser Gly Asp Asp His Pro Arg 

2325 2330 2335 

Ala Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp 



1925 

He Tyr Trp Val Asp Met Gly Leu 
1940 



1930 1935 * 

i, Ser Thr He Ser Arg Ala Lys Arg 
1945 1950 
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23 i° 2345 2350 

r Gly 

2,JbQ 

Val^eu Thr Leu lie Gladys Asp lie Arg Thr Pro Asn Gly Leu Ala 



Asn Glu Leu His Pro Ser lie Met Arg Ala Ala Leu Se/Sy" Ala Asn 
- ' 2360 2365 



lie Asp His Arg Ala Leu Tyr Phe Ser^sp Ala Thr Leu Asp 

Z39Q 2395 *>A(\k 

Lys lie Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val lie Leu 
2 . 4 . 0S ... 2410 4 2415 

Gly Leu Ala Val Tyr Gly Glu 
, _ ^425 2430 

lie Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys 
. 35 2ii< > 2445 

^Tsl ASP LeU L6U Val A *P fro Gin Gin 

2455 2460 
Pro Met Gly lie lie Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu 
2470 2475 2480 

Ser Pro Cys Arg lie Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu 
2485 2490 
y His Val Asn 
0 

e Thr Cys Arg 

u Cys Ala Asn 
25 30 - 253S 2 540 



Lys Ser Glu Pro Val His Pro Phe Gl/Zeu Ala Val Tyr Gly"" His 
2420 2425 



Thr His Gin Gl/lils Val Asn Cys Ser'cys Arg Gly Gly Arg'ue Leu 

2500 2505 2510 

Gin Glu Asp Phe Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin 

2520 2525 
Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys lie Ser Phe Ser Leu Thr 

Z5 30 ' 253S 2540 

Cys Asp Gly Val Ser His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser 
* „ ?550 2555 2560 

Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Asn Asn 
256 * 2570 2575 



2605 



Gly Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Val Asp Tyr Cys 

2580 2585 2590 

Giy Asp Gly Ser Asp Glu lie Pro Cys Asn Lys Thr Ala Cys Gly Val 
2595 2600 2605 

Gly ^n Ar9 ° yS Ar $ A *P G1 V Ser Cys lie Gly Asn Ser Ser Arg 

2615 2620 

Cys Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys 

625 2630 2635 2640 

Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val 

2645 2650 2655 

Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp 
2660 2665 2670 

p Glu 

CyS 2690 G ^ ^ Ar9 2695 ^ Pt ° L6U Phe Ala C * s 

Pro Ser Gly Arg Cys lie Pro Met Ser Trp Thr"™ Asp Lys Glu Asp 

705 2710 2715 2720 

Asp Cys Glu Asn Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser 

2725 2730 2 735 

Glu Ala Gin Phe Glu Cys Gin Asn His Arg Cys lie Ser Lys Gin Trp 

2740 OT A c 1 _ ^ 



Val Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp 
2o75 2680 w * 



2740 2745 2750 

p Glu 

— -— - «rou 27 65 

HiS o C £« G1 ° Gly Lys Thr Cys G1 y Pro Se * Ser Phe Ser Cys Pro Gly 
2770 . 2775 2780 



w *■ 2750 

Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala 
2755 2760 27 65 



Thr His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp 
785 t 2790 2795 2800 

Cys Thr Asp Gly Ala Asp Glu Ser Val Thr Ala Gly Cys Leu Tyr Asn 
28 °5 2810 2815 
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Ser Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Leu Cys lie 

2820 2825 2830 

Pro Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser 

2835 2840 " 2845 

Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr Cys Gly Pro Asa Glu Phe 

2850 2855 2860 

Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp 
865 2870 . * 2875 * 2880 

Gly Glu Asn Asp Cys His Asp His Ser Asp Glu Ala Pro Lys Asn Pro 

2885 2890 2895 

His Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu 

2900 2905 .2910 

Cys Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin 

2915 2920 2925 

Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg Gly Cys His Val Asn Glu 

2930 2935 2940 

Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu 
945 2950 2955 2960 

Lys lie Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp 

2965 2970 ' 2975 

Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu Cys Ser Thr Thr Phe Pro 

2980 2985 2990 

Cys Ser Gin Leu Cys lie Asn Thr His Gly Ser Tyr Lys Cys Leu Cys 

2995 3000 " 3005 

Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala 

3010 3015 3020 

Val Thr Asp Glu Glu Pro Phe Leu lie Phe Ala Asn Arg Tyr Tyr Leu 
025 3030 3035 3040 

Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly 

3045 3050 3055 

Leu Asn Asn Ala Val Ala Leu Ala Phe Asp Tyr Arg Glu Gin Met He 

3060 3065 3070 

Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser Met He Arg Arg Met His 

3075 3080 3085 

Leu Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn 

3090 3095 3100 

Pro Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys 
105 3110 3115 ^ 3120 

Asp Lys Gly Arg Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr 

3125 3130 3135 

Arg Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val 

3140 3145- 3150 

Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His 

3155 3160 3165 

Ser Leu He Gly Arg He Gly Met Asp Gly Ser Gly Arg Ser He He 

3170 3175 .... 3180 

Val Asp Thr Lys lie Thr Trp Pro Asn Gly Leu Thr Val Asp tyr Val 
185 3190 3195 3200 

Thr Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe 

3205 3210 3215 

Ala Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He 

3220 3225 3230 

Pro His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr 

3235 3240 3245 . 

Asp Trp Glu Thr Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Ala 

3250 3255 3260 

Asn Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His 
265 3270 3275 3280 

Val Phe His Ala Leu Arg Gin Pro Asp Val Pro Asn His Pro Cys Lys 
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3285 3290 3295 

Val Asn Asn Gly Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Gly Gly 
3300 3305 . 3310 J 2 

Gly His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Gly Asp Gly 
3315 3320 3325 J 

Arg Thr Cys Val Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn 

3,3U 3335 3340 

Asp Lys Cys lie Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys 
3350 3355 3360 

Gly Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg 

3365 . 3370 3375 

Pro Gly Gin Phe Gin Cys Ser Thr Gly He Cys Thr Asn Pro Ala Phe 

3380 3385 3390 

He Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn 
3395 3400 3405 

Cy %^E IU His Val C y s 0 Leu * ro Ser Gin Phe Lys Cys Thr Asn Thr 

J * - u 3415 3420 

Asn Arg Cys He Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys 
5 3430 3435 3440 

Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro 

3445 „ 3450 3455 

Asn Gin Phe Gin Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp 

34 60 34 65 3470 

Val Cys Asp Arg Asp Asn His Cys Val Asp Gly Ser Asp Glu Pro Ala 
3475 3480 3485 

ASr \?£« Thr Gln M6t Thr Cys G ly Val As P Glu Phe Arg Cys Lys Asp 

3490 3495 3500 

Ser Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp 
1 3510 3S1 5 3520 

cys Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr 

3525 3530 3535 

Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly 

3540 3545 3550 

Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu 
3555 3560 3565 

Cys Thr Pro Arg Pro °y s Ser Glu Ser Gl" Phe Phe Cys Ala 
3570 3575 3580 

Asn Gly Arg Cys He Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp 
= 85 3590 3595 3 g 00 

Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met 

3605 3610 3615 

Asp Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Pro 

3620 3625 36 30 

Cys Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys 
3635 3640 364S 

Gly ,~5 Gly Val Ar9 Thr °y s Pro Leu A «P Glu Phe Gin Cys Asn Asn 

3650 3655 3660 

Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys 
~ 6S „ 3670 3675 3680 

Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe He Cys 
_ „ 3685 3690 3695 

Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp 

3700 3705 3710 

lie Gly Arg Gin Cys Asp Gly Val Asp Asn Cys Gly Asp Gly Thr Asp 

3715 3720 3725 

Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin Asn Pro His Cys Lys Asp 

3730 3735 3740 

Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu 
745 3750 3755 3760 
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Arg Cys Asn Met Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Giu Asp 
m . 3765 3770 3775 

Cys Ser He Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser Met 

37 *0 3785 3790 

Cys Gly Asp Glu Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys 

37 95 3800 ' 3805 

Ala Cys Arg Ser Gly Phe His Thr Val Prb Gly Gin Pro Gly Cys Gin 

3810 3815 . * 3820 

Asp He Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Trp Asn 
825 3830 3835 3840 

Lys Pro Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys 

3845 3S50 3855 

Thr His Asn Thr Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr 

3860 3865 3870 

He Ala Asp Asp Asn Glu He Arg Ser Leu Phe Pro Gly His Pro His 

3875 3880 3885 

Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp Glu Ser Val Arg He Asp 

3890 3895 3900 • 

Ala Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp 
905 3910 3915 3920 

His Thr Gly Thr He Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro 

3925 3930 3935 

Thr Thr Ser Asn Arg His Arg Arg Gin He Asp Arg Gly Val Thr His 

3940 3945 3950 

Leu Asn He Ser Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp 

3955 3960 3965 

Val Ala Gly Asn Val Tyr Trp Thr Asp Ser Gly Arg Asp Val lie Glu 

3970 3975 3980 

Val Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met 
985 3990 3995 4000 

He Asp Glu Pro His Ala He Val Val Asp Pro Leu Arg Gly Thr Met 

4005 4010 . 4015 

Tyr Trp Ser Asp Trp Gly Asn His Pro Lys He Glu Thr Ala Ala Met 

4020 4025 4030 

Asp Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn He Gin Trp Pro 

4035 4040 4045 

thr Gly Leu Ala Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp 

4050 4055 4060 

Ala Lys Leu Ser Val He Gly Ser He Arg Leu Asn Gly Thr Asp Pro 
065 4070 4075 4080 

He Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He 

4085 4090 4095 

Asp Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg 

4100 4105 4110 

Val Phe Lys lie His Lys Phe Gly His Ser Pro Leu Tyr Asn Leu Thr 

4115 4120 4125 

Gly Gly Leu Ser His Ala Ser Asp Val Val Leu Tyr His Gin His Lys 

4130 4135 4140 

Gin Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu 
145 4150 4155 " 4160 

Cys Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys 

4165 4170 4175 

Arg Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro 

4180 4185 4190 

Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr Leu Gin Cys Phe Asn Gly 

4195 4200 4205 

Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arc Cys Gin 

4210 4215 4220 

Pro Arg Tyr Thr Gly Asp Lys Cys Giu Leu Asp Gin Cys Trp Glu Tyr 



FIG. 12B 



WO 01/92474 PCT7US01/18041 

50/91 



225 4230 4235 

Cys His Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser. Gly Met Pro Thr 

4250 • 4255 

Cys Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Ala Gin Val 

4260 4265 
Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr Cys Thr Val Asn Gin Gly 
4215 4280 4285 

ASn 4290 ^ Arg 4M S ^ Pr ° Gly ^ he LSU Gly Arg C * s 



4 290 4295 4300 

Gin Tyr Arg Glh Cys Ser Gly Phe Cys Glu Asn Phe Gly Thr Cys Gin 
305 43ia 4 315 * 3 4320 

Met Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Val Tyr Phe Glu 
4325 4330 4335 

3 Cys Ser Arg Cys Leu Gin 
•--»~ 4345 4350 

Cys Val Val Asn Lys Gin Thr Gly Asp Val Thr Cys Asn Cys Thr Asp 
4355 43CQ 4355 



4335 

Gly Pro Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Gin Gly Ala 

4340 4345 4350 

Cys Val Val Asn Lys Gin Thr Gly Asp Val Thr Cys Asn Cys Thr Asp 
4355 4360 4365 

Gly ^2 VaI A1 * Pr ° Ser Cys Leu Thr C V S Ile Asp His Cys Ser Asn 

4370 4375 4380 

Gly Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys 
385 43 90 4395 4400 

Pro Pro His Met Thr Gly Pro Arg Cys Gin Glu Gin Val Val Ser Gin 

4405 4410 4415 

Gin Gin Pro Gly His Met Ala Ser He Leu lie Pro Leu Leu Leu Leu 

44 20 4425 4430 

Leu Leu Leu Leu Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg 

4435 4440 "4445 

Val Arg Gly Ala Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala 

4 4 50 4455 44 60 

Met Asn Val Glu He Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly 
465 4470 4475 4480 



I, 4475 4480 

Glu Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp 

4485 4490 44B5 

Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr 

4500 4505 4510 

Met Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys 
4515 4520 4525 



4505 4510 
1 Ser Leu Ala Ser Thr Asp 
n-Jij qzzo 4525 

Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu lie Gly Asp Pro Leu 
4530 4535 d^n 



4535 4540 

Ala 
545 
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55 



103 



151 



199 



247 



GCTACAATCC ATCTGGTCTC CTCCAGCTCC TTCTTTCTGC AAC ATG GGG AAG AAC 

Met Gly Lys Asn 
1 

AAA CTC CTT CAT CCA ACT CTG GTT CTT CTC CTC TTG GTC CtC CTG CCC 
Lys Leu Leu His Pro Ser Leu Val Leu Leu Leu Leu Val Leu Leu Pro 
5 10 . 15 20 

ACA * AC ^° TCA GT ? TCT - GGA A** CCG CA<3 TAT ATG GTT CTG GTC CCC 
Thr Asp Ala Ser Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro 
25 30 35 

TCC CTG CTC CAC ACT GAG ACC ACT GAG AAG GGC TGT GTC CTT CTG AGC 
Ser Leu Leu His Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser 
40 45 50 

TAC CTG AAT GAG ACA GTG ACT GTA AGT GCT TCC TTG GAG TCT GTC AGG 
Tyr Leu Asn Glu Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg 
5 5 60 65 

GGA AAC AGG AGC CTC TTC ACT GAC CTG GAG GCG GAG AAT GAC GTA CTC 295 
Gly Asn Arg Ser Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu 
70 75 80 

CAC TGT GTC GCC TTC GCT GTC CCA AAG TCT TCA TCC AAT GAG GAG GTA 343 
His Cys Val Ala Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val 
85 90 .95 . 100 

ATG TTC CTC ACT GTC CAA GTG AAA GGA CCA ACC CAA GAA TTT AAG AAG 391 
Met Phe Leu Thr Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys 
105 no us 

CGG ACC ACA GTG ATG GTT AAG AAC GAG GAC AGT CTG GTC TTT GTC CAG 439 
Arg Thr Thr Val Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin 
120 125 130 

ACA GAC AAA TCA ATC TAC AAA CCA GGG CAG ACA GTG AAA TTT CGT GTT 487 
Thr Asp Lys Ser lie Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val 
135 140 145 

GTC TCC ATG GAT GAA AAC TTT CAC CCC CTG AAT GAG TTG ATT CCA CTA 535 
Val Ser Met Asp Glu Asn Phe His Pro Leu Asn Glu Leu lie Pro Leu 
150 155 160 

GTA TAC ATT CAG GAT CCC AAA GGA AAT CGC ATC GCA CAA TGG CAG AGT 583 
Val Tyr lie Gin Asp Pro Lys Gly Asn Arg lie Ala Gin Trp Gin Ser 
165 170 175 180 

TTC CAG TTA GAG GGT GGC CTC AAG CAA TTT TCT TTT CCC CTC TCA TCA 631 
Phe Gin Leu Glu Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser 
185 190 us 

GAG CCC TTC CAG GGC TCC TAG AAG GTG GTG GTA CAG AAG AAA TCA GGT 679 
Glu Pro Phe Gin Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly 
200 205 210 

GGA AGG ACA GAG CAC CCT TTC ACC GTG GAG GAA TTT GTT CTT CCC AAG 727 
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Gly Arg Thr Glu His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys 

220 225 

TTT GAA GTA CAA GTA AC A GTG CCA AAG ATA ATC ACC ATr frr r*R r** 
Phe Glu Val Gin Val Thr Val Pro Lys lie lit J£ 2 SJ gJu 
4 235 240 

m« 22 SI* l GT GGC dTA ' TAC ACA TAT GGG AAG CCT GTC 

Glu Met Asn Val Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val 

CCT GGA CAT GTG ACT GTGAGC ATT TGC AGA AAG TAT AGT GAC GCT TCP 
Pro Gly His Val Thr Val Ser He Cys ArJ £Js lyl Ser Asp S Ser 
265 270 2 7S 



- 775 



823 



871 



919 



So r?3 ??* ? ^ ^ G GCT TTC TGT GAG »** TTC AGT GGA 

Asp Cys Kxs Gly Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly 

280 285 290 ■ 

rtS fI A i* C f° u AT TGC TTC TAT ^ CAA GTA AAA ACC AAG GTC 967 

Gin Leu Asn Ser His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val ' 
295 300 305 

TTC CAG CTG AAG AGG AAG GAG TAT GAA ATG AAA CTT CAC ACT GAG GCC 1015 
Phe Gin Leu Lys Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala 
310 315 320 

CAG ATC CAA GAA GAA GGA ACA GTG GTG GAA TTG ACT GGA AGG CAG TCC 1063 
Gin lie Gin Glu Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser 
325 330 . 335 340 

AGT GAA ATC ACA AGA ACC ATA ACC AAA CTC TCA TTT GTG AAA GTG GAC 1111 
Ser Glu He Thr Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp 
345 350 355 

TCA CAC TTT CGA CAG GGA ATT CCC TTC TTT GGG CAG GTG CGC CTA GTA 1159 
Ser His Phe Arg Gin Gly lie Pro Phe Phe Gly Gin Val Arg Leu Val 
360 365 370 

GAT GGG AAA GGC GTC CCT ATA CCA AAT AAA GTC ATA TTC ATC AGA GGA 1207 
Asp Gly Lys Gly Val Pro He Pro Asn Lys Val He Phe He Arg Gly 
375 380 385 

AAT GAA GCA AAC TAT TAC TCC AAT OCT ACC ACG GAT GAG CAT GGC CTT 1255 
Asn Glu Ala Asn Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu 
390 395 400 

GTA CAG TTC TCT ATC AAC ACC ACC AAC GTT ATG GGT ACC TCT CTT ACT 1303 
Val Gin Phe Ser He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr 
405 410 415 420 

GTT AGG GTC AAT TAC AAG GAT CGT AGT CCC TGT TAC GGC TAC CAG TGG 1351 
Val Arg Val Asn Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp 
425 430 . 435 

GTG TCA GAA GAA CAC GAA GAG GCA CAT CAC ACT GCT TAT CTT GTG TTC 1399 
Val Ser Glu Glu His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe "' 
440 445 45Q 
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2222 2 2 2 2 2 S £ 2 E SI 2 2 

4 60 465 

s 2 2 si s 2 s 2 2 s si si 2 ? G r « »» 

470 4-ic AXa His T y r Ile Leu Asn Gly 

4 /5 480 J 

2 S 2 2 2 2 2 2 2 E 2 5 5 2 S 2 - 

495 500 



2 SI 2 2 2 2 2 S 2 £ SI 2 2 SI 2 2 1591 
2 2 2 K 2 2 SI E 2 2 2 2 2 2 2 2 - 

525 530 

2222222222222222 «" 

540 545 

2 2 2 2 2 2 E 2 2 2 2 2 2 2 2 2 "» 

555 560 

2 2 2 2 2 2 2 E 2 2 2 2 2 2 2 2 

570 S75 sac 



2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 
2222222222222222 

605 61Q 

2u° Ser S Ser S ST JJ r J* C f TG T CTA CCA GAA ™ CTC ACT 

615 31 Tyr As ." Leu Leu Pro G ^ I-ys Asp Leu Thr 

620 g 25 

GGC TTC CCT GGG CCT.TTG AAT GACCAG GAC GAT GAA GAC TGC ATC AAT 
Gly Phe Pro Gly Pro Leu Asn Asp Gin Asp Asp Glu £p Jys ill £n 

£9 Ss A^n S? JX S J** r? A *? C ACA ™ ACT CCA G ™ TCA AGT 
645 Tyr H„ 71311 Gly Ile Thr T V r Thr Pro Val Ser Ser 

650 655 660 

2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 * on 

665 670 * 675 

S S £n S £s ?S CC ° ^ ATG TGT CCA CAG CTT 2119 

axa me Thr Asn Ser Lys lie Arg Lys Pro Lys Met Cys Pro Gin Leu 

685 690 



1831 



1879 



1927 



1975 



2023 
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CAA CAG TAT GAA ATG CAT GGA CCT GAA GGT CTA CGT GTA GGT TTT TAT 2167 
Gin Gin Tyr Glu Met His Gly Pro Glu Gly Leu Arg Val Gly Phe Tyr 
€95. 700 ~ 705 

GAG TCA GAT GTA ATG GGA AGA GGC CAT GCA CGC CTG GTG CAT GTT GAA 2215 
Glu Ser Asp Val Met Gly Arg Gly His Ala Arg Leu Val His Val Glu " 
710 715 720 

GAG CCT CAC ACG GAG ACC GTA CGA AAG TAG TTC CCT GAG ACA TGG ATC 2263 
Glu Pro His Thr Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Tip lie 
725 730 735 740 

TGG GAT TTG GTG GTG GTA AAC TCA GCA GGG GTG GCT GAG GTA GGA GTA 2311 
Trp Asp Leu Val Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val 
745 750 755 

ACA GTC CCT GAC ACC ATC ACC GAG TGG AAG GCA GGG GCC TTC TGC CTG 2359 
Thr Val Pro Asp Thr lie Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu 
760 765 770 

TCT GAA GAT GCT GGA CTT GGT ATC TCT TCC ACT GCC TCT CTC CGA GCC 2407 
Ser Glu Asp Ala Gly Leu Gly lie Ser Ser Thr Ala Ser Leu Arg Ala 
775 780 785 

TTC CAG CCC TTC TTT GTG GAG CTT ACA ATG CCT TAC TCT GTG ATT CGT 2455 
Phe Gin Pro Phe Phe Val Glu Leu Thr Met Pro Tyr Ser Val lie Arg 
790 795 800 

GGA GAG GCC TTC ACA CTC AAG GCC ACG GTC CTA AAC TAC CTT CCC AAA 2503 
Gly Glu Ala Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys 
805 810 815 820. 

TGC ATC CGG GTC AGT GTG CAG CTG GAA GCC TCT CCC GCC TTC CTT GCT 2551 
Cys lie Arg Val Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala 
825 830 835 

GTC CCA GTG GAG AAG GAA CAA GCG CCT CAC TGC ATC TGT GCA AAC GGG 2599 , 
Val Pro Val Glu Lys Glu Gin Ala Pro His Cys lie Cys Ala Asn Gly 
840 845 850 

i 

CGG CAA ACT GTG TCC TGG GCA GTA ACC CCA AAG TCA TTA GGA AAT GTG 2647 
Arg Glri Thr Val Ser Trp Ala Val Thr Pro Lys Ser Leu Gly Asn Val 
855 860 " 865 

AAT TTC ACT GTG AGC GCA GAG GCA CTA GAG TCT CAA GAG CTG TGT GGG 2695 
Asn Phe Thr Val Ser Ala Glu Ala Leu Glu Ser Gin Glu Leu Cys Gly 
870 875 880 - 

ACT GAG GTG CCT TCA GTT CCT GAA CAC GGA AGG AAA GAC ACA GTC ATC 2743 
Thr Glu Val Pro Ser Val Pro Glu His Gly Arg Lys Asp Thr Val lie 
885 890 895 900 

AAG CCT CTG TTG GTT GAA CCT GAA GGA CTA GAG AAG GAA ACA ACA TTC 2791 
Lys Pro Leu Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe 
905 910 * 915 

AAC TCC CTA CTT TGT CCA TCA GGT GGT GAG GTT TCT GAA GAA TTA TCC 2839 
Asn Ser Leu Leu Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser 
920 925 930 
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CTG AAA CTG CCA CCA AAT GTG GTA GAA GAA TCT GCC CGA GCT TCT GTC 
Leu Lys Leu Pro Pro Asn Val Val Giu Glu Ser Ala Arg Ala Ser Val 
9 35 9 4 o 945 

TCA GTT TTG GGA GAC ATA TTA GGC TCT GCC ATG CAA AAC ACA CAA AAT 
Ser Val Leu Gly Asp lie Leu Gly Ser Ala Met Gin Asn Thr Gin Asn 
950 955 960 

CTT CTC CAG ATG CCC TAT GGC TGT GGA GAG CAG AAT ATG GTC CTC TTT 
Leu Leu Gin Met Pro Tyr Gly Cys Gly Glu Glh Asn Met Val Leu Phe 
965 970 975 980 

GCT CCT AAC ATC TAT GTA CTG GAT TAT CTA AAT GAA ACA CAG CAG CTT 
Ala Pro Asn He Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Glh Leu 
985 990 995 

ACT CCA GAG GTC AAG TCC AAG GCC ATT GGC TAT CTC AAC ACT GGT TAC 
Thr Pro Glu Val Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr 
1000 1005 1010 

CAG AGA CAG TTG AAC TAC AAA CAC TAT GAT GGC TCC TAC AGC ACC TTT 
Gin Arg Gin Leu Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe 
101S 1020 1025 

GGG GAG CGA TAT GGC AGG AAC CAG GGC AAC AGC TGG CTC ACA GCC TTT 
Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp Leu Thr Ala Phe 
1030 1035 1040 

GTT CTG AAG ACT TTT GCC CAA GCT CGA GCC TAC ATC TTC ATC GAT GAA 
Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr lie Phe He Asp Glu 
L045 1050 1055 1060 

GCA CAC ATT ACC CAA GCC CTC ATA TGG CTC TCC CAG AGG CAG AAG GAC 
Ala His lie Thr Gin Ala Leu He Trp Leu Ser Gin Arg Gin Lys Asp 
1065 1070 1075 

AAT GGC TGT TTC AGG AGC TCT GGG TCA CTG CTC AAC AAT GCC ATA AAG 
Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys 
1080 1085 " 1090 

GGA GGA GTA GAA GAT GAA GTG ACC CTC TCC GCC TAT ATC ACC ATC GCC 
Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala 
1095 1100 1105 

CTT CTG GAG ATT CCT CTC ACA GTC ACT CAC CCT GTT GTC CGC AAT GCC 
Leu Leu Glu He Pro Leu Thr Val Thr His Pro Val Val Arg Asn Ala 
1110 1115 . 1120 

CTG TTT TGC CTG GAG TCA GCC TGG AAG ACA GCA CAA GAA GGG GAC CAT 
Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His 
1125 H30 1135 1140 

GGC AGC CAT GTA TAT ACC AAA GCA CTG CTG GCC TAT GCT TTT GCC CTG 
Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu 
1145 H50 ' 1155 

GCA GGT AAC CAG GAC AAG AGG AAG GAA GTA CTC AAG TCA CTT AAT GAG 
Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu 
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SJ j£ vll £2 S2 ?f ^ C I" GTC TGG GAG CGC CCT CAG 3607 
Glu Ala Val Lys Lys Asp Asn S er Val His Trp Glu Arg Pro Gin Lys 

1175 H80 1185 

CCC AAG GCA CCA GTG GGG CAT TTT TAG GAA CCC CAG GCT CCC TCT GCT 3655 
Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro cX SI Pro Ser £a 

H95 1200 

tit SI? 2? m TG 5? o CC TAT GTG C * C CTC 6CT TAT CTC ACG GCC CAG 3703 
Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin 

1Z0& 121° . 1215 1220 

So 25 So T° C l CG ^ GA ° CTG ACC TCT GCA ACC ^ ATG GTG AAG 3751 
Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys 

I 225 1230 1235 

SS A ° G f* 6 ^f G ~ AG ** T GCC CAG GGC GGT T TC TCC TCC ACC CAG 3799 

Trp lie Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin 
1240 1245 1250 

R« 5S v TG GCT CTC GCT CTG TCC TAT GGA GCC GCC ACA 3847 

Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr 
12 55 1260 1265 

TTT ACC AGG ACT GGG AAG GCT GCA CAG GTG ACT ATC CAG TCT TCA GGG 3895 

9 Thr Gly Lys Ala Ala Gln Val Thr He Gin Ser Ser Gly 
!270 1275 1280 

ACA TTT TCC AGC AAA TTC CAA GTG GAC AAC AAC AAT CGC CTG TTA CTG 3943 
,™ phe Ser Ser Gin Val Asp Asn Asn Asn Arg Leu Leu Leu 

1285 1290 1295 1300 

CAG CAG GTC TCA TTG CCA GAG CTG CCT GGG GAA TAC AGC ATG AAA GTG 3991 
Gin Gin Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val 
1305 1310 1315 

ACA GGA GAA GGA TGT GTC TAC CTC CAG ACC TCC TTG AAA TAC AAT ATT 4039 . 
Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu Lys Tyr Asn He 
1320 1325 1330 

* 

CTC CCA GAA AAG GAA GAG TTC CCC TTT GCT TTA GGA GTG CAG ACT CTG 4087 
Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly Val Gin Thr Leu 
1335 1340 1345 

CCT CAA ACT TGT GAT GAA CCC AAA GCC CAC ACC AGC TTC CAA ATC TCC 4135 

r °,5Mj Thr Cys As P Glu Pro Ala His Th * Ser Phe Gin lie Ser 
1350 1355 1360 

CTA AGT GTC AGT TAC ACA GGG AGC CGC TCT GCC TCC AAC ATG GCG ATC 4183 
Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser Asn Met Ala He 
1365 1370 1375 1380 

GTT GAT GTG AAG ATG GTC TCT GGC TTC ATT CCC CTG AAG CCA ACA GTG 4231 
val Asp Val Lys Met Val Ser Gly Phe lie Pro Leu Lys Pro Thr Val 
1385 1390 ' 1395 

AAA ATG CTT. GAA AGA TCT AAC CAT GTG AGC CGG ACA GAA GTC AGC AGC 4279 
Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr Glu Val Ser Ser 
1400 1405 1410 
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JSn vll S r? T l hC r TT f T MG GTG TCA CAG *CA CTG AGO 4327 
His Val Leu lie Tyr Leu Asp Lys Val Ser Asn Gin Thr Leu Ser 
1415 l4 20 1425 

r TG ll C 11° * GG 6TT CTG CAA GAT GTC CCA GTA AGA GAT CTC AAA CCA 4375 
Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg Asp Leu Lys Pro 

1430 1435 1440 

GCC ATA GTG AAA GTC TAT GAT TAC TAC GAG ACG GAT GAG TTT GCA ATC ' 4423 
Ala lie Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp Glu Phe Ala lie 
1445 1450 . 1455 1460 

GCT GAG TAC AAT GCT CCT TGC AGC AAA GAT CTT GGA AAT GCT TGAAGACCA 4474 
Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly Asn Ala 
1465 1470 ~ x 

CAAGGCTGAA AAGTGCTTTG CTGGAGTCCT GTTCTCTGAG CTCCACAGAA GArACrrrvv as.* a 
TTTGTATCTT TAAAGACTTG ATGAATAAAC ACTTTTTCTG ^ CA ° AGAA G ^ C ^^ «34 
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Ser Val Ser Gly Lys Pro Gin Tyr Met Val Leu Val Pro Ser Leu Leu 

15 10 15 

His Thr Glu Thr Thr Glu Lys Gly Cys Val Leu Leu Ser Tyr Leu Asn 

20 25 30 

Glu Thr Val Thr Val Ser Ala Ser Leu Glu Ser Val Arg Gly Asn Arg 

35 40 45 

Ser Leu Phe Thr Asp Leu Glu Ala Glu Asn Asp Val Leu His Cys Val 

50 55 -60 

Ala Phe Ala Val Pro Lys Ser Ser Ser Asn Glu Glu Val Met Phe Leu 
65 70 75 80 

Thr Val Gin Val Lys Gly Pro Thr Gin Glu Phe Lys Lys Arg Thr Thr 

85 90 95 

Val Met Val Lys Asn Glu Asp Ser Leu Val Phe Val Gin Thr Asp Lys 

100 105 110 

Ser He Tyr Lys Pro Gly Gin Thr Val Lys Phe Arg Val Val Ser Met 

115 120 125 

Asp Glu Asn Phe His Pro Leu Asn Glu Leu He Pro Leu Val Tyr He 

130 135 140 

Gin Asp Pro Lys Gly Asn Arg He Ala Gin Trp Gin Ser Phe Gin Leu 
145 150 155 160 

Glu Gly Gly Leu Lys Gin Phe Ser Phe Pro Leu Ser Ser Glu Pro Phe 

165 170 175 

Gin Gly Ser Tyr Lys Val Val Val Gin Lys Lys Ser Gly Gly Arg Thr 

180 185 190 

Glu His Pro Phe Thr Val Glu Glu Phe Val Leu Pro Lys Phe Glu Val 

195 200 205 

Gin Val Thr Val Pro Lys He He Thr He Leu Glu Glu Glu Met Asn 

210 215 220 

Val Ser Val Cys Gly Leu Tyr Thr Tyr Gly Lys Pro Val Pro Gly His 
225 230 235 240 

Val Thr Val Ser He Cys Arg Lys Tyr Ser Asp Ala Ser Asp Cys His 

245 250 255 

Gly Glu Asp Ser Gin Ala Phe Cys Glu Lys Phe Ser Gly Gin Leu Asn 

260 265 " 270 

Ser His Gly Cys Phe Tyr Gin Gin Val Lys Thr Lys Val Phe Gin Leu 

275 280 285 

Lys Arg Lys Glu Tyr Glu Met Lys Leu His Thr Glu Ala Glh He Gin 

290 295 300 

Glu Glu Gly Thr Val Val Glu Leu Thr Gly Arg Gin Ser Ser Glu He 
305 310 315 320 

Thr Arg Thr He Thr Lys Leu Ser Phe Val Lys Val Asp Ser His Phe 

325 330 335 

Arg Gin Gly He Pro Phe Phe Gly Gin Val Arg Leu Val Asp Gly Lys 

340 345 350 

Gly Val Pro He Pro Asn Lys Val He Phe He Arg Gly Asn Glu Ala 

355 360 365 

Asn Tyr Tyr Ser Asn Ala Thr Thr Asp Glu His Gly Leu Val Gin Phe 

370 375 380 

Ser He Asn Thr Thr Asn Val Met Gly Thr Ser Leu Thr Val Arg Val 
385 390 395 400 

Asn Tyr Lys Asp Arg Ser Pro Cys Tyr Gly Tyr Gin Trp Val Ser Glu 

405 - ,410 415 

Glu His Glu Glu Ala His His Thr Ala Tyr Leu Val Phe Ser Pro Ser 

420 425 430 

Lys Ser Phe Val His Leu Glu Pro Met Ser His Glu Leu Pro Cys Gly 

435 440 445 

His Thr Gin thr Val Gin Ala His Tyr He Leu Asn Gly Gly Thr Leu 

450 455 460 

Leu Gly Leu Lys Lys Leu Ser Phe Tyr Tyr Leu He Met Ala Lys Gly 
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465 470 475 480 

Gly lie Val Arg Thr Gly Thr His Gly Leu Leu Val Lys Gin Glu Asp 

485 490 495 

Met Lys Gly His Phe Ser lie Ser lie Pro Val Lys Ser Asp He Ala 

^ „ , 500 505 510 

Pro Val Ala Arg Leu Leu lie Tyr Ala Val Leu Pro Thr Gly Asp Val 

515 520 525 

lie Gly Asp Ser Ala Lys Tyr Asp Val Glu Asn Cys Leu Ala Asn Lys 

530 535 540 

Val Asp Leu Ser Phe Ser Pro Ser Gin Ser Leu Pro Ala Ser His Ala 
?f 5 550 555 560 

His Leu Arg Val Thr Ala Ala Pro Gin Ser Val Cys Ala Leu Arg Ala 

565 570 575 

Vai Asp Gin Ser Val Leu Leu Met Lys Pro Asp Ala Glu Leu Ser Ala 

5*0 585 590 

Ser Ser Val Tyr Asn Leu Leu Pro Glu Lys Asp Leu Thr Gly Phe Pro 

595 6 oo 605 . 

Gly Pro Leu Asn Asp Gin Asp Asp Glu Asp Cys He Asn Arg His Asn 

"0 615 620 

Val Tyr He Asn Gly He Thr Tyr Thr Pro Val Ser Ser Thr Asn Glu 
625 630 635 640 

Lys Asp Met Tyr Ser Phe Leu Glu Asp Met Gly Leu Lys Ala Phe Thr 

645 650 " " 655 

Asn Ser Lys He Arg Lys Pro Lys Met Cys Pro Gin Leu Gin Gin Tyr 

660 665 670 

Glu Met His Gly Pro Glu Gly Leu Arg Val Gly Phe Tyr Glu Ser Asp 

675 680 685 

Val Met Gly Arg Gly His Ala Arg Leu V.al His Val Glu Glu Pro His 

690 695 700 

Thr Glu Thr Val Arg Lys Tyr Phe Pro Glu Thr Trp He Trp Asp Leu 
705 710 715 720 

Val Val Val Asn Ser Ala Gly Val Ala Glu Val Gly Val Thr Val Pro 

725 730 ~ 735 

Asp Thr He Thr Glu Trp Lys Ala Gly Ala Phe Cys Leu Ser Glu Asp 

740 745 750 

Ala Gly Leu Gly lie Ser Ser Thr Ala Ser Leu Arg Ala Phe Gin Pro 

755 760 765 

Phe Phe Val Glu Leu Thr Met Pro Tyr Ser Val He Arg Gly Glu Ala 

770 775 780 

Phe Thr Leu Lys Ala Thr Val Leu Asn Tyr Leu Pro Lys Cys lie Arg 
785 790 795 " $00 

Val Ser Val Gin Leu Glu Ala Ser Pro Ala Phe Leu Ala Val Pro Val 

805 810 815 

Glu Lys Glu Gin Ala Pro His Cys He Cys Ala Asn Gly Arg Gin Thr 

820 825 830 

Val Ser Trp Ala. Val Thr Pro Lys Ser Leu Gly Asn Val Asn Phe Thr 

835 840 845 

Val Ser Ala Glu Ala Leu Glu. Ser Gin Glu Leu Cys Gly Thr Glu Val 

850 855 860 

Pro Set Val Pro Glu His Gly Arg Lys Asp Thr Val He Lys Pro Leu 
865 870 875 ~ 880 

Leu Val Glu Pro Glu Gly Leu Glu Lys Glu Thr Thr Phe Asn Ser Leu 

885 890 895 

Leu Cys Pro Ser Gly Gly Glu Val Ser Glu Glu Leu Ser Leu Lys Leu 

900 905 910* 

Pro Pro Asn Val Val Glu Glu Ser Ala Arg Ala Ser Val Ser Val Leu 

915 920 925 

Gly Asp He Leu Gly Ser Ala Met Gin Asn Thr Gin Asn Leu Leu Gin 
930 . 935 940 
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Met Pro Tyr Gly Cys Gly Glu Gin Asn Met Val Leu Phe Ala Pro Asn 

950 955 
He Tyr Val Leu Asp Tyr Leu Asn Glu Thr Gin Gin Leu Thr Pro Glu 

965 970 975 

Val Lys Ser Lys Ala He Gly Tyr Leu Asn Thr Gly Tyr Gin Arg Gin 

980 985 390 

Leu Asn Tyr Lys His Tyr Asp Gly Ser Tyr Ser Thr Phe Gly Glu Arg 
, 995 1000 . 1005 • 

TyJ \^£ *** Asn Gln Gly Asn Thlr Tr P Leu Ala Phe Val Leu Lys 

1010 1015 10 20 

Thr Phe Ala Gin Ala Arg Ala Tyr lie Phe lie Asp Glu Ala His lie 
025 1030 1035 1040 

Thr Gin Ala Leu lie Trp Leu Ser Gin Arg Gin Lys Asp Asn Gly Cys 

1045 1050 1055 

Phe Arg Ser Ser Gly Ser Leu Leu Asn Asn Ala He Lys Gly Gly Val 
„, . lo 60 1065 1070 

Glu Asp Glu Val Thr Leu Ser Ala Tyr He Thr He Ala Leu Leu Glu 
1075 1080 1085 

tnon LCU Val * hr His Pro Val Val Arg Asn Ala Leu Phe Cys 
1090 1095 noo 

Leu Glu Ser Ala Trp Lys Thr Ala Gin Glu Gly Asp His Gly Ser His 
105 111° 1115 1120 

Val Tyr Thr Lys Ala Leu Leu Ala Tyr Ala Phe Ala Leu Ala Gly Asn 
„, 1125 1130 1135 

Gin Asp Lys Arg Lys Glu Val Leu Lys Ser Leu Asn Glu Glu Ala Val 

1140 1145 1150 

Lys Lys Asp Asn Ser Val His Trp Glu Arg Pro Gin Lys Pro Lys Ala 

1155 ii 6 o 
Pro Val Gly His Phe Tyr Glu Pro Gin Ala Pro Ser Ala Glu Val Glu 

1170 H75 H80 

Met Thr Ser Tyr Val Leu Leu Ala Tyr Leu Thr Ala Gin Pro Ala Pro 
«5 H90 H95 1200 

Thr Ser Glu Asp Leu Thr Ser Ala Thr Asn He Val Lys Trp lie Thr 

t 1205 1210 ' 1215 

Lys Gin Gin Asn Ala Gin Gly Gly Phe Ser Ser Thr Gin Asp Thr Val 

1220 1225 1230 

Val Ala Leu His Ala Leu Ser Lys Tyr Gly Ala Ala Thr Phe Thr Arg 

1235 1240 1245 

Thr Gly Lys Ala Ala Gin Val Thr He Gin Ser. Ser Gly Thr Phe Ser 

1250 1255 1260 

Ser Lys Phe Gin Val Asp Asn Asn Asn Arg Leu Leu Leu Gin Gin Val 
265 1270 1275 1280 

Ser Leu Pro Glu Leu Pro Gly Glu Tyr Ser Met Lys Val Thr Gly Glu 

1285 1290 1295 

Gly Cys Val Tyr Leu gin 2ilE Ser Leu Lys Tj£r &sn lie Lea Pro Glu 
1300 1305 1310 

1315 '1320 l^9*T* ~" ~ 

1330 1335 1340 * 

,5*3 1350 1355 *——*•*"* 1360 

1380 1385 * % 1390 

j£i&3le~&r"TO 

1395 1400 1405" 
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CAGCGGTGCG 
CTCCATCAAG 
CGCCTCCTCC 
TTCGCCCGGG 
TGGATTTCGG 
CTACCTCTTC 
GAGGAGGGGG 
AGCAGGACCA 



AGCTCCAGGC 
CCCCCTCCAA 
CAATTGTGCA 
GAGGGGGAAA 
GGCAGGGGGC 
ACCGACGCCC 
AAAGGAGGAA 
GAGGGGAAGG 



CCATGCACTG 
AGGCTCCCCT 
TTTTTGCAGC 
GAGCAGCGAG 
GCACCCCCGT 
CTGGTGCGCT 
AAGGGGGACC 
GGCTGCTGCT 



AGGAGGCGGA 
ACCCGGTCCA 
CGGAGGCGGC 
GAGTGAAGCG 
CAGCAGGCCC 
TTGCCGAAGG 
CCCCAACTGG 
TGCATCAGCC 



AACAAGGGGA 
CGCCCCCCAC 
TCCGAGATGG 
GGGGGGTGGG 
TCCCCAAGGG 
AAAGAATAAG 
GGGGGGfGAA 
CACACC ATG 
Met 
1 



GCCCCCAGAG 

CCCCCCTCCC 

GGGTGTGAGC 

GT.GAAGGGTT 

GCTCGGAACT 

AACAGAGAAG 

GGAGAGAAGT 

CTG ACC 

Leu Thr 



Pro Jro IS 2? ?* r TG ? TG CCC CTG CTC TCA GC * CTG 6TC GCG GCG 
Pro Pro Leu Leu Leu Leu Leu Pro Leu Leu Ser Ala Leu Val Ala Ala 

3 10 15 

All J2 Asp S PrI £2 J? 2* AGC CCC ^ GGG TGC AGA 

JUa ne Asp Ala Pro Lys Thr Cys Set Pro Lys Gin Phe Ala Cys Arg 

° 30 35 

Asp Xn" lit S J? rT° ?* GGC CGG TGC GAC GGT GAG AGG 
Asp Gin He Thr Cys lie Ser Lys Gly Trp Arg Cys Asp Gly Glu Arg 

40 « 50 

Aso S£ Pro 2n r? A I CT GAC GAG GCC CCT GAG ATT TGT ^A CAG AGT 
Asp Cys Pro Asp Gly Ser Asp Glu Ala Pro Glu lie Cys Pro Gin Ser 

55 60 65 • 

£Js Ala G?n Si rf ?* **° GAG CM TGC CTG GGT ACT GAG 

Lys Ala Gin Arg Cys Gin Pro Asn Glu His Asn Cys Leu Gly Thr Glu 

70 75 80 

Ivl Val E S2 I* CGC ? ^ T6C MT GGG GTC CAG GAC TGC ATG 
Leu Cys Val Pro Met Ser Arg Leu Cys Asn Gly Val Gin Asp Cys Met 
85 go 95 

GAC GGC TCA GAT GAG GGG CCC CAC TGC CGA GAG CTC CAA GGC AAC TGC 
Asp Gly Ser Asp Glu Gly Pro His Cys Arg Su Leu aTn Gly Asn Cys 
100 ll0 11S 

III If* fit rf 2* ^ CAC ^ TGT GTC CCC ACA CTC GAT GGG CCC 
Ser Arg Leu Gly Cys Gin His His Cys Val Pro Thr Leu Asp Gly Pro 

120 125 130 

Thr SS riS rf ^ c GC AGC ^ ^ CTT CAG GCA GAT GGC AAG ACC 
Thr Cys Tyr Cys Asn Ser Ser Phe Gin Leu Gin Ala Asp Gly Lys Thr 

* 35 140 145 

CvS fii SJI Ik T t AT ^ AG TGC TCA GTG TAC G6C ACC TGC AGC CAG CTA 
Cys Lys Asp Phe Asp Glu Cys Ser Val Tyr Gly Thr Cys Ser Gift Leu 

150 155 !60 

SS JX ^ Sf* GAC S? 0 TCC TtC ATA TGT GGC TGT GTT GAA GGA TAC 
Cys Thr Asn Thr Asp Gly Ser Phe lie Cys Gly Cys Val Glu Gly Tyr 
i6S 170 175 

SS fin rt 0 o° G GAT ^° CGC TCC TGC MG G CC AAG AAC GAG CCA GTA 
1BO " ° SP A != Ar9 Ser Cys ^ Ala A ^ Glu Pro Val 

180 185 190 195 



60 
120 
180 
' 240 
300 
. 360 
420 
475 



523 



571 



619 



667 



715 



763 



811 



859 



907 



955 



1003 



.1051 
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S 2j S S S3 2 III s S JS E X s r c ?T C ™ Gcc 10 » 

200 Ser 61n Asn lie Leu Ala 

205 210 

S lyr 2 £ Sg E Gin JS E !? ^ ** CCT ACG AGG ACG 
215 ?1 !?£ Thr lle Thr pr ° Thr Ser Thr 

. 220 225 

CGG CAG ACC ACA GCC ATG GAC Ttc irir *r,m 

Arg Gin Thr Thr Ala Met Aso 25 522 3 G ?° ^ GAG ACC GTA TGC 

230 p 55? Tyr Ma Asn Glu Thr Va * c ys 

235 240 

S 52 K "I S5 S E SI S" 2° f K Ms »* « 

245 P Ala Ala Gln Thr G ^n -Leu Lys Cys Ala 

250 255 

5 a s s a g 3 e s si s ss s as s 

265 270 275 

- 2 - 2 ffi S SK g E « £ S SZ 2 £ 

265 290 

~ - - 1 K - S £ £ 5 £ LI S E ?S SS 
Sn g JS S5 £ S S 51? !? ? TG CTA GAC CTG GAA CTC ™ 

9 *sn biy Asp Thr Cys Val Thr Leu Leu Asp Leu Glu Leu Tyr 

315 320 

-1-^ S S 2 LI S E S 5 JS SI! E E 

335 

s 2 5 s- s g a s - s - 5 - s « 



345 .350 - - - 3 y 5 

365 370 

lie S Leu Asp 2 5? f ° S * f 616 ™ C TGG GCA GAT GCC TAT 
lie Thr Leu Asp Leu Val Ser Arg Leu Val Tyr .Trp Ala Asp Ala Tyr 



385 



Su Asp Sr ne Glu SI? S*? f° ? T GGC GGC "C CAG ACC 
su Asp Tyr lie Glu Val Val Asp Tyr Glu Gly Lys Qly Arg Gin Thr 

395 400 

?S rTf Sf 6 G ?° A T° CTG ATT GAG »C CTG TAC GGC CTG ACT GTG TTT 
He lie Gin Gly lie Leu lie Glu His Leu Tyr cly Leu JE vTl III 



415 



25 J2I g* 2 ttl S Jh c f 0 TCG GAC MT GCC gcg cag cag 
42S y yr r Asn Ser Asp Asn Aia Asn Aia gi * <»n 

430 435 



1147 



1195 



1243 



1291 



1339 



1387 



1435 



1483 



1531 



1579 



1627 



1675 



1723 



1771 
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1819 



SSs 3£ Ser Val He ?* S*? ^° CGC TTT AGC ACC GAG ««■« 

Lys Thr Ser Val lie Arg Val Asn Arg Phe Asn Ser Thr Glu Tyr Gin 

4i0 4 45 ■ 450 

S Tnr Are 51? f C ff* CTC CAC ATC TAC CAC CAG "67 

val Val Thr Arg Val Asp Lys Gly Gly Ala Leu His lie Tyr His Gin ' 

455 46Q 4 6 5 

AGG CGT CAG CCC CGA GTG AGG AGC CAT GCC TGT GAA AAC GAC CAG TAT 191 S 
Arg Arg Gin Pro Arg Val Arg Ser His Ala Cys Glu Asn Asp Gin Tyl " 
470 475 480 

GGG AAG CCG G6T GGC TGC TCT GAC ATC TGC CTG CTG GCC AAC AGC CAC 19 S3 
Gly Lys Pro Gly Gly Cys Ser Asp lie Cys Leu Leu A?a Js*n Ser Ss 
485 490 495 

AAG GCG CGG ACC TGC CGC TGC CGT TCC GGC TTC AGC CTG GGC AGT GAC 2011 
Lys Ala Arg Thr Cys Arg Cys Arg Ser Gly Phe Ser Leu Gly Ser Asp 
500 505 510 J 51 | 

GGG AAG TCA TGC AAG AAG CCG GAG CAT GAG CTG TTC CTC GTG TAT GGC 2059 
Gly Lys Ser Cys Lys Lys Pro Glu His Glu Leu Phe Leu Val Tyr Gly 
520 525 530 

AAG GGC CGG CCA GGC ATC ATC CGG GGC ATG GAT ATG GGG GCC AAG GTC 2107 
Lys Gly Arg Pro Gly He He Arg Gly Met Asp Met Gly Ala Lys Val 
535 540 545 

CCG GAT GAG CAC ATG ATC CCC ATT GAA AAC CTC ATG AAC CCC CGA GCC 21S5 
Pro Asp Glu His Met lie Pro He Glu Asn Leu Met Asn Pro Arg Ala 
550 555 560 

CTG GAC TTC CAC GCT GAG ACC GGC TTC ATC TAC TTT GCC GAC ACC ACC 2203 
Leu Asp Phe His Ala Glu Thr Gly Phe He Tyr Phe Ala Asp Thr Thr 
565 570 575 

AGC TAC CTC ATT GGC CGC CAG AAG ATT GAT GGC ACT GAG CGG GAG ACC 2251 
Ser Tyr Leu lie Gly Arg Gin Lys He Asp Gly Thr Glu Arg Glu Thr 
580 585 590 ' 595 

ATC CTG AAG GAC GGC ATC CAC AAT GTG GAG GGT GTG GCC GTG GAC TGG 2299 
He Leu Lys Asp Gly He His Asn Val Glu Gly Val Ala Val Asp Trp 
600 60S 610 

ATG GGA GAC AAT CTG TAC TGG ACG GAC GAT GGG CCC AAA AAG ACA ATC 2347 
Met Gly Asp Asn Leu Tyr Trp Thr Asp Asp Gly Pro Lys Lys Thr He 
615 620 625 

AGC GTG GCC AGG CTG GAG AAA GCT GCT CAG ACC CGC AAG ACT TTA ATC 2395 
Ser Val Ala Arg Leu Glu Lys Ala Ala Gin Thr Arg Lys Thr Leu He 
630 635 640 

GAG GGC AAA ATG ACA CAC CCC AGG GCT ATT GTG GTG GAT CCA CTC AAT 2443 
Glu Gly Lys Met Thr His Pro Arg Ala He Val Val Asp Pro Leu Asn 
645 650 655 

GGG TGG ATG TAC TGG ACA GAC TGG GAG GAG GAC CCC AAG GAC AGT CGG 2491 
Gly Trp Met Tyr Trp Thr Asp Trp Glu Glu Asp Pro Lys Asp Ser Arg 
660 665 670 67S 
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685 6go 

s S? g s "° s s'2 s s r s° f R mc ^ rac 

695 Pr ° A?n G1 * Leu Ser Leu Asp 



700 705 



S S K 5 2 g = s £ S SS £ g g S - 

715 720 

GAG ACG ATA CtG CTC AAT GGC ACA Gar rvr n »r « m „ 

Glu Thr lie Leu Leu Asn r1« SiT » CGG AAG ATT GTG TAT GAA GGT 
^nr xxe Leu Leu Asn Gly Thr Asp Arg Lys lie Val Tyr Glu Gly 

I « = S S i - ^ S £ SS S £ K £ 

750 7 5 5 



765 77 0 



773 780 785 

Pre 5E III til if ? A ™ G TAT GAT GCC C ^ CAG CAG CAA GTT GGC 
Pro lie Phe Glu lie Arg Met Tyr Asp Ala Gin Gin Gin Gin Val Gly 



800 



880 



2683 



2731 



2779 



2827 



2875 



JS ^° CGG GTG GGC GGG TGC AGC AGC CTG TGC TTG 

Thr Asn Lys Cys Arg Val Asn Asn Gly Gly Cys Ser Ser let c£ Leu 

810 815 

Ala J£ SI S ! J 00 2° TGC GCC TGT GCt GAG GAC ^G GTG TTG 

fl?n y «f? n Cys Cvs Glu Asp Gin Val Leu 

825 830 835 

S S £ SS flf S S a 2Sg lf c 3019 

840 845 850 



2923 



2971 



- S S S S Sg !K S S SEEK £ S 3087 

855 86.0 865 

2 til Arc S its 2! f C S* GAC GAT TGC CTG GAC AG * 3115 
J?? TlEp - X *» C * s As P <£y Asp Asn Asp Cys Leu Asp Asn Ser 



Sp SS S SS S f TC 7* £?* CAG CAC ACC TGC CCC TCG GAC CGA 3163 

Asp Glu Ala Pro Ala Leu Cys His Gin His Thr Cys Pro Ser Asp Arg 

oyo 895 

lie iCs Cys' 2S US f C CGG £ GC ATC CCC CGC TGG CTC TGC GAC 3211 
lot LyS CyS Glu Asn g» Ar * Cv * He Pro Asn Arg Trp Leu Cys Asp 

905 910 sis 
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GGG GAG 
Gly Asp 



TCA GCC 
Ser Ala 



TGC ATC 
Cys lie 



ACT CAG 
Thr Gin 
980 

TGC GAC 
Cys Asp 



AGC CAC 
Ser His 



£ SS IV SJ £S £ sj r Sf ?» «* »<* 

920 Asp Giu Ser Asn Ala Cys 

925 93 0 

So fS Cvs Pro S C ° ?° ° AG TTC TCC TGT GCC AGT GGC CGC 
Arg Thr Cys Pro Pro Asn Gin Phe Ser Cys Ala Ser Gly Arg 

940 945 

So S Ser S £ SI ? At f TC GAT GAC W TGT GGG GAC 
950 P r ASP L6U As P ^ Gly Asp 

=»« 960 

SS = 5 S S SI g 5 s 5 S S 5 S g 2° 

9*75 

«2 S S* KS S ° ^ 7* MC AT * AAC « AGA 
ne ™ r CyS A 311 G1 V A** Cys He Asn lie Asn Trp Arg 

JE 2S ?2 J AC ! GT GGG ^C AAC AGT GAC GAA GCC GGC TGC 
Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu Ala Gly Cys 
1000 1005 loio 

Ser III III 5f 5? 2* TTC ** G TGC AAC AGC GGG CGT TGC 
Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn Ser Gly Arg Cys 
1015 1020 1025 

55 Pro Sis S ?hr 2? t hT S" GAC ** T GAC TGC CCA GAC TAC 

103? P y \o3S y ^ Gly ASP Tyr 

Sr So 35 S 3£ S 7* ACC ^ CAG GCC ACG AGG CCC CCT 

1045 CyS Thr Asn Gln A^ Thr Arg Pro Pro 

1050 1055 

S GlJ Cys SE S SI rf p£° T TGC ° 6G CTG GAT GGA CTA TGC 

1060 y iEE PhG 61n GyS Leu As ? G1 V Le * C V S 

AUM 1070 1075 

ATC CCC CTG CGG TGG CGC TGC GAT GGG GAC ACT GAC TGC ATG GAC TCC 
tie Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp £ys £? Asp Ser 
108Q 1085 1090 

Ser SS X? f* 6 o GC GAG GGA GTG ACC CA C G TC TGC GAT CCC AGT 

Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val Cys Asp Pro Ser 

1095 1100 ixos 

S5 £Js IS S ° r G ° ^ G GAC TCA GCT CGG »* ATC AGC AAA GCG TGG 
Val Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys lie Ser Lys Ala Trp 

v 1115 112 0 

52 CVS Asn Sv 22 ^ n A ° I™ GAG GAT TCG GAC GAG GAG 

val Cys Asp Gly Asp Asn Asp Cys Glu Asp Asn Ser Asp Glu Glu Asn 

1125 H30 U35 * * 

Its Glu Ser 2 S ?* CC ° TCG ^ CCT TGT GCC ™ ^ C 

1140 la i?H Pr ° Pr ° Ser His Pro Ala Ash 

1150 H55 



3259 



3307 



3355 



3403 



3451 



3499 



3547 



3595 



3643 



3691 



3739 



3787 



3835 



3883 



3931 
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Tte Xr* S? SS 2S d CC IP ° AC CTG TGT GAT GGC GAC GAC 3979 
Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp Gly Asn Asp Asp 

1160 1165 ino 

*f, T .^ GC GAC GGC TCA GAT GAG GGC GAG CTC TGC GAC CAG TGC TCT CTG 4027 
Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp Gin Cys Ser Leu 
1175 H.80 H85 

AAT AAC GGT GGC TGC AGC CAC AAC TGC TCA GTG GCA CCT GGC GAA GGC 4075 
Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala Pro Gly Glu Gly 
1190 1195 1200 

rTL vl? £ GT = CC l b ° 2 CT CTG CGC ATG GAG CTG GGG CCG CAC AAC CAC 4123 
lie Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly Pro Asp Asn His 
1*05 1210 1215 

^ l G ° £ AG r? C o AG AGC TAC TGT GCC *** CAT CTC AAA TGC AGC CAA 4171 
Thr Cys Gin lie Gin Ser Tyr Cys Ala Lys His Leu Lys Cys Ser Gin 
1220 122S 1230 1235 

AAG TGC GAC CAG AAC AAG TTC AGC GTG AAG TGC TCC TGC TAC GAG GGC 4219 
Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser Cys Tyr Glu Glv 
1240 1245 1250 

TGG GTC CTG GAA CCT GAC GGC GAG AGC TGC CGC AGC CTG GAC CCC TTC 4267 
Trp Val Leu Glu Pro Asp Gly Glu Ser Cys Arg Ser Leu Asp-Pro Phe, 
1255 1260 1265 

AAG" CCG TTC ATC ATT TTC TCC AAC CGC CAT GAA ATC CGG CGC ATC GAT 4315 
Lys Pro Phe lie lie Phe Ser Asn Arg His Glu lie Arg Arg lie Asp 
1270 1275 1280 

CTT CAC AAA GGA GAC TAC AGC GTC CTG GTG CCC GGC CTG CGC AAC ACC 4363 
Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly Leu Arg Ash Thr 
1285 1290 1295 

ATC GCC CTG GAC TTC CAC CTC AGC CAG AGC GCC CTC TAC TGG ACC GAC 4411 
lie Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu Tyr Trp Thr Asp 
1300 1305 1310 1315 

GTG GTG GAG GAC AAG ATC TAC CGC GGG AAG CTG CTG GAC AAC GGA GCC 4459 
Val Val Glu Asp Lys lie Tyr Arg Gly Lys Leu Leu Asp Asn Gly Ala 
1320 1325 1330 

CTG ACT AGT TTC GAG GTG GTG ATT CAG TAT GGC CTG GCC ACA CCC GAG 4507 
Leu Thr Ser Phe Glu Val Val lie Gin Tyr Gly Leu Ala Thr Pro Glu 
1335 1340 1345 

GGC CTG GCT GTA GAC TGG ATT GCA GGC AAC ATC TAC TGG GTG GAG AGT 4555 
Gly Leu Ala Val Asp Trp lie Ala Gly Asn lie Tyr Trp Val Glu Ser 
. 1350 1355 1360 

AAC CTG GAT CAG ATC GAG GTG GCC AAG CTG GAT GGG ACC CTC CGG ACC 4603 
Asn Leu Asp Gin lie Glu Val Ala Lys Leu Asp Gly Thr Leu Arg Thr 
1365 1370 1375 

ACC CTG CTG GCC GGT. GAC ATT GAG CAC CCA AGG GCA ATC GCA CTG GAT 4 651 
Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala lie Ala Leu Asp 
1380 1385 1390 139S 
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4 699 



4747 



4795 



4843 



So Sc til S S C P G TGG ACA GAC TGG GAT GgC AGC CTG CCC 

Pro Arg Asp Gly lie Leu Phe Trp Thr Asp Trp Asp Ala Ser Leu Pro 

1400 1405 i 410 

S lie Glu G ? C f C m TG c QT GGG GGT GGG CGC CGG ACC CAC 
Arg lie Glu Ala Ala Ser Met Ser Gly Ala Gly Arg Arg Thr Val His 

1415 1420 1425 

CGG GAG ACC GGC TCT'GGG GGC TGG CCC AAC GGG CTC ACC GTG GAC TAC 
Arg Glu Thr Gly Ser Gly Gly Trp Pro Asn Gly Leu t£ Sal Asp Tyr 
1430 1435 1440 

fI G fO* f" 5 n GC r? C CTT TGG ATT GAC GCC AGG TCA GAT GCC ATT TAC 
Leu Glu Lys Arg He Leu Trp He Asp Ala Arg Ser Asp Ala ill Tyr 

1450 1455 

l~ ^ C CGT l kC GAC G ?° TCT GGC 010 ATG GAG GTG CTT CGG GGA CAC 4891 
Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val Leu Arg Gly His 

1460 146 5 1470 1475 

GAG TTC CTG TCG CAC CCG TTT GCA GTG ACG CTG TAC GGG GGG GAG GTC 4939 
Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr Gly Gly Glu Val 
1480 1485 1490 

l GG A E T GAG TGG CGA ACA MC ACA CTG CCT AAG GCC AAC AAG TGG 4987 
Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys Ala Asn Lys Trp 
i49s 1500 1505 

fSr ??5 «f ?7° ~ CC GTG GTA CAG AGG ACC ^ ACC CAG CCC TTT 5035 

Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn Thr Gin Pro Phe 

* 5 1° 1515 1520 

GAC CTG CAG GTG TAC CAC CCC TCC CGC CAG CCC ATG GCT CCC AAT CCC SOB 3 
Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met Ala Pro Asn Pro 
I 525 1530 1535 

TGT GAG GCC AAT GGG GGC CAG GGC CCC TGC TCC CAC CTG TGT CTC ATC 5131 
Cys Glu Ala Asn Gly Gly Gin Gly Pro Cys Ser His Leu Cys Leu He 
1540 1S4 5 1550 1555 

AAC TAC AAC CGG ACC GTG TCC TGC GCC TGC CCC CAC CTC ATG AAG CTC 5179 
Asn Tyr Asn Arg Thr Val Ser Cys Ala Cys Pro His Leu Met Lys Leu 
1560 1565 1570 

CAC AAG GAC AAC ACC ACC TGC TAT GAG TTT AAG AAG TTC CTG CTG TAC 5227 
His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys Phe Leu Leu Tyr 
15" 1580 1585 

GCA CGT CAG ATG GAG ATC CGA GGT GTG GAC CTG GAT GCT CCC TAC TAC 5275 
Ala Arg Gin Met Glu He Arg Gly Val Asp Leu Asp Ala Pro Tyr Tyr 
1590 1595 1600 

AAC TAC ATC ATC TCC TTC ACG GTG CCC GAC ATC GAC AAC GTC ACA GTG 5323 
Asn Tyr He He Ser Phe Thr Val Pro Asp He Asp Asn Val Thr Val 
I 605 1610 1615 

CTA GAC TAC GAT GCC CGC GAG CAG CGT GTG TAC TGG TCT GAC GTG CGG 5371 
Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp Ser Asp Val Arg 
1620 1625 1630 1635 
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£ Sn SS S K! t GG G ? C TTC MC ^ GGC ACA GGC GTG G AG AGA 
Thr Gin Ala He Lys Arg Ala Phe He Asn Gly Thr Gly Val Glu Thr 

1640 1645 16S0 

Sal SI? w 5?* f A ° P G CCA *** GCC CAG GGG C TG GCT GTG GAG TGG 
Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu Ala Val Asp Trp 

16SS 1660 1665 . P p 



~Z7 "Z ZZ , * L iuvs AW TAT GAC ACC AAT AAG AAG CAG 
Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr Asn Lys Lys Gin 
1670 16T5 

V,-; GT ? J 0 CGG CTG GAT GGC IW Ti'C AAG AAC GCA GTG GTG CAG 

i«< ^ g LeU ^«S Gly Ser Phe L * s Asn Ala v *l Val Gin 

1690 



5419 



5467 



5515 



SS E S S 25 I GG 2* I AT GAC ACC » CAG 

c Asn 

1680 

K S «I? !5 ft «? GA I 2 GC ? CG !« «« ^ G GCA GTG GTG CAG 5563 

s Asn 
1695 

2S fl» r AG » CC CAT GGC °™ GTC GTC CTG CGT GGG AAG 5611 

Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro Leu Arg Gly Lys 

1700 "OS 1710 - 1715 

Su JSr JS £k° GAT 2?* GAC ATC AGC A * G G CC AAC ATG GAT GGC S659 
Leu Tyr Trp Thr Asp Gly Asp Asn lie Ser Met Ala Asn Met Asp Gly 

1720 1725 1730 

S £f CGC £k C CTG CTC TTC AGT GGC CAG AAG GGC CCC GTG GGC CTG 5707 
Ser Asn Arg Thr Leu Leu Phe Ser Gly Gin Lys Gly Pro Val Gly Leu 
17 3S 1740 1745 * 

S ill 2° ll C f T GAA AGC *** CTC TAC TGG ATC AGC TCC GGG AAC 575S 
Ala He Asp Phe Pro Glu Ser Lys Leu Tyr Trp He Ser Ser Gly Ash 
1750 17S5 1760 

ml S£ C r? C 5* CGC TGC ^ CTG GAT GGG AGT GGG CTG GAG GTC ATC 5803 
i£« Acg Cys Asn teu ASD Gly Ser Gly Leu Glu Val lie 

1165 1770 1775 

GAT GCC ATG CGG AGC CAG CTG GGC AAG GCC ACC GCC CTG GCC ATC ATG 5851 
Asp Ala Met Arg Ser Gin Leu Gly Lys Ala Thr Ala Leu Ala lie Met 
1780 "85 1790 1795 

GGG GAC AAG CTG TGG TGG GCT GAT CAG GTG TCG GAA AAG ATG GGC ACA 5899 
Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu Lys Met Gly Thr 
1800 1805 1010 

TGC AGC AAG GCT GAC GGC TCG GGC TCC GTG GTC CTT CGG AAC AGC ACC 5947 
Cys Ser Lys Ala Asp Gly Ser Gly Ser Val Val Leu Arg Asn Ser Thr 
1815 1820 1825 

Th°- fl? f/? u TG S* C ATG AAG GTC TAT GAC GAG AGC ATC CAG CTG GAC 5995 
Thr ^u Val Met His Met Lys Val Tyr Asp Glu Ser He Gin Leu Asp 
!830 1835 1840 

CAT AAG GGC ACC AAC CCC TGC AGT GTC AAC AAC GGT GAC TGC TCC CAG 6043 

,«?f Gly Thr Asn Pro Cys ?er Vai Gly Asp Cys Set Gin 

1845 1850 1855 

CTC TGC CTG. CCC ACG TCA GAG ACG ACC CGC TCC TGC ATG TGC ACA GCC 6091 
Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys Met Cys Thr Ala 
I860 1865 1870 1875 
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GGC TAT AGC CTC CGG AGT GGC CAG CAG GCC TGC GAG GGC GTA GGT TCC 6139 
Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu Gly Val Gly Ser 
1880 1885' 1890 

TTT CTC CTG TAC TCT GTG CAT GAG GGA ATC AGG GGA ATT CCC CTG GAT 6187 
Phe Leu Leu Tyr Ser Val His Glu Gly He Arg Gly lie Pro Leu Asp 
1895 1900 1905 

CCC AAT GAG AAG TCA GAT GCC CTG GTC CCA GTG TCC GGG ACC TCG CTG 6235 
Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser Gly Thr Ser Leu 
1910 1915 1920 

GCT GTC GGC ATC GAC TTC CAC GCT GAA AAT GAC ACC ATC TAC TGG GTG 6283 
Ala Val Gly He Asp Phe His Ala Glu Asn Asp Thr He Tyr Trp Val 
1925 1930 1935 

GAC ATG GGC CTG AGC ACG ATC AGC CGG GCC AAG CGG GAC CAG ACG TGG 6331 
Asp Met Gly Leu Ser Thr He Ser Arg Ala Lys Arg Asp Gin Thr Trp 
"40 1945 1950 1955 

CGT . GAA GAC GTG GTG ACC AAT GGC ATT GGC CGT GTG GAG GGC ATT GCA 6379 
Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val Glu Gly He Ala 
I960 1965 ~ 1970 

GTG GAC TGG ATC GCA GGC AAC ATC TAC TGG ACA GAC CAG GGC TTT GAT 6427 
Val Asp Trp He Ala Gly Asn He Tyr Trp Thr Asp Gin Gly Phe Asp 
1975 1980 1985 

GTC ATC GAG GTC GCC CGG CTC AAT GGC TCC TTC CGC TAC GTG GTG ATC 6475 
Val lie Glu Val Ala Arg Leu Asn Gly Ser Phe Arg Tyr Val Val lie 
1990 1995 2000 

TCC CAG GGT CTA GAC AAG CCC CGG GCC ATC ACC GTC CAC CCG GAG AAA 6523 
Set Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val His Pro Glu Lys 
2005 2010 2015 

GGG TAC TTG TTC TGG ACT GAG TGG GGT CAG TAT CCG CGT ATT GAG CGG 6571 
Gly Tyr Leu Phe Trp Thr Glu Trp Gly Gin Tyr Pro Arg He Glu Arg 
2020 2025 2030 " 2035 

TCT CGG CTA GAT GGC ACG GAG CGT GTG GTG CTG GTC AAC GTC AGC ATC 6619 
Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val Asn Val Ser He - 
2040 2045 2050 

AGC TGG CCC AAC GGC ATC TCA GTG GAC TAC CAG GAT GGG AAG CTG TAC 6667 
Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Asp Gly Lys Leu Tyr 
2055 2060 2065 

TGG TGC GAT GCA CGG ACA GAC AAG ATT GAA CGG ATC GAC CTG GAG ACA 6715 
Trp Cys Asp Ala Arg Thr Asp Lys He Glu Arg He Asp Leu Glu Thr 
2070 2075 2080 

GGT GAG AAC CGC GAG GTG GTT CTG TCC AGC AAC AAC ATG GAC ATG TTT 6763 
Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn Met Asp Met Phe 
2085 2090 2095 

TCA GTG TCT GTG TTT GAG GAT TTC ATC TAC TGG AGT GAC AGG ACT CAT 6811 
Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser Asp Arg Thr His 
2100 2105 2110 2115 
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S !Sn 35 sS if ^ ^ 2?* AGC ^ GAC RAT GCC ACA GAC TCC 
Ala Asn Gly Ser lie Lys Arg Gly Ser Lys Asp Asn Ala Thr Asp Ser 

2120 2125 2130 

GTG CCC CTG CGA ACC GGC ATC GGC GTC CAG CTT AAA GAC ATC AAA GTC 
Val Pro Leu Arg Thr Gly lie Gly Val Gin Leu %S Asp lie Lys* 85 
2135 2140 .. 2145 

Sf ^ « GG ? C ^ GG ^ G 660 ACC ^ GTG TGG GCG GTG GCC AAT 

2^0 P Gln Thr Asn VaI C * s Ala Val Ala Asn 

ZADU 2155 2160 

25 Glv £5 Sfn T° TG o GC P G TAC CGG GGC CGT GGG CGG GCC 7003 

2?65 217? Gly 2 A 7f Gly Gln Ar9 Ala 



6859 



6907 



6955 



Ss S 2£ £ PS r? G S TG f TG GCT GAA GAC GGA GCA «* TGC CGC 7051 
Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly Ala Ser Cys Arg 

■ 0 2185 2190 2195 

SS SI S£ nS f C ° TG f TC TAC TCA GAG CGC ACC ATT CTC AAG ACT 7099 
Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr He Leu Lys Ser 

2200 2205 2210 

ill S?° °I G l° G a* 1 CGC CTC *** GCG CCC GTG CAG CCC TTC 7147 

lie His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro Val Gln Pro Phe 

2215 2220 2225 

GAG GAC CCT GAG CAC ATG AAG AAC GTC ATC GCC CTG GCC TTT GAC TAC 7195 

ASP ,^° G1U His * et Asn v *l II- Ala Leu Ala Phe Asp Tyr 
223 0 2235 2 240 

S2 ^ of ^° ?* CCG GGC ACC CCC CGC ATC TTC TTC AGC GAC 7243 

Arg , A J a G1 y Thr Ser p "> Thr Pro Asn Arg He Phe Phe Ser Asp 

2245 2250 2255 

E! S A c VJ ^ ATC CAA CAG ATC GAC GAT GGC TCC AGG AGG 7291 

lie His Phe Gly Asn lie Gln Gln He Asn Asp Asp Gly Ser Arg Arg 
2260 2265 2 270 2275 

T?f rV , GTG GAA ^ GTG GGC TCC GTG CAA GGC CTG GCC TAT CAC .7339 
lie Thr lie Val Glu Asn Val Gly Ser Val Glu Gly Leu Ala Tyr His 
2280 2285 2290 

CG J GGC ICG G AC ACT CTC TAT TGG ACA AGC TAC ACG ACA TCC ACC ATC 7387 
Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr Thr Ser Thr He 
2295 2300 2305 

JS £5° tu A f/ G GAC CAG ACC CGC CCA GGG GCC TTC GAG CGT GAG 7435 

Thr Arg His Thr Val Asp Gln Thr Arg Pro Gly Ala Phe Glu Arg Glu 
2310 2315 2320 

S*? tT C * CT GGA G AT GAC CAC CCA CGG GCC TTC GTT TTG 7483 

o«= U Thr Met SeE Gly Asp Asp His Pro Arg Ala Phe Val Leu 
2325 2330 2335 

GAC GAG TGC CAG AAC CTC ATG TTC TGG ACC AAC TGG AAT GAG CAG CAT 7531 
Asp Glu Cys Gln Asn Leu Met Phe Trp Thr Asn Trp Asn Glu Gln His 
2340 2345 2 350 2355 
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CCC AGC ATC ATG CGG GCG GCG CTC TCG GGA GCG AAT GTC CTG ACC CTT 7579 
Pro Ser lie Met Arg Ala Ala Leu Ser Gly Ala Asn Val Leu Thr Leu 
2360 2365 2370 

ATC GAG AAG GAC ATC CGT ACC CCC AAT GGC CTG GCC ATC GAC CAC CGT 7627 
lie Glu Lys Asp He Arg Thr Pro Asn Gly Leu Ala He Asp His Arg 
2375 2380 2385 

GCC GAG AAG CTC TAC TTC TCT GAC GCC ACC CTG GAC AAG ATC GAG CGG 7675 
Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp Lys He Glu Arg 
2390 2395 2400 

TGC GAG TAT GAC GGC TCC CAC CGC TAT GTG ATC CTA AAG TCA GAG CCT 7723 
Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu Lys Ser Glu Pro 
2405 2410 2415 

GTC CAC CCC TTC GGG CTG GCC GTG TAT GGG GAG CAC ATT TTC TGG ACT 7771 
Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His lie Phe Trp Thr 
2420 2425 2430 2435 

GAC TGG GTG CGG CGG GCA GTG CAG CGG GCC AAC AAG CAC GTG GGC AGC 7819 
Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys His Val Gly Ser 
2440 2445 2450 



AAC ATG AAG CTG CTG CGC GTG GAC ATC CCC CAG CAG CCC ATG GGC ATC 7867 
Asn Met Lys Leu Leu Arg Val Asp He Pro Gin Gin Pro Met Gly He 
2455 2460 2465 

ATC GCC GTG GCC AAC GAC ACC AAC AGC TGT GAA CTC TCT CCA TGC CGA 7915 
He Ala Val Ala Ash Asp Thr Asn Ser Cys Glu Leu Ser Pro Cys Arg 
2470 1 2475 2480 

ATC AAC AAC GGT GGC TGC CAG GAC CTG TGT CTG CTC ACT CAC CAG GGC 7963 
He Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu Thr His Gin Gly 
2485 2490 2495 

CAT GTC AAC TGC TCA TGC CGA GGG GGC CGA ATC CTC CAG GAT GAC CTC 8011 
His Val Asn Cys Ser Cys Arg Gly Gly Arg lie Leu Gin Asp Asp Leu 
2500 2505 2510 2515 

ACC TGC CGA GCG GTG AAT TCC TCT TGC CGA GCA CAA GAT GAG TTT GAG 8059 
Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin Asp Glu Phe Glu 
2520 2525 2530 

TGT GCC AAT GGC GAG TGC ATC AAC TTC AGC CTG ACC TGC GAC GGC GTC 8107 
Cys Ala Asn Gly Glu Cys He Asn Phe Ser Leu Thr Cys Asp Gly Val. 
2535 2540 2545 

CCC CAC TGC AAG GAC AAG TCC GAT GAG AAG CCA TCC TAC TGC AAC TCC 8155 
Pro His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser Tyr Cys Asn Ser 
2550 2555 2560 

CGC CGC TGC AAG AAG ACT TTC CGG CAG TGC AGC AAT GGG CGC TGT GTG 8203 
Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Ser Asn Gly Arg Cys Val 
2565 2570 2575 

TCC AAC ATG CTG TGG TGC AAC GGG GCC GAC GAC TGT GGG GAT GGC TCT 8251 
Ser Asn Met Leu Trp Cys Asn Gly Ala Asp Asp Cys Gly Asp Gly Ser 
2580 2585 2590 2595 
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GAC GAG ATC CCT TGC AAC AAG ACA GCC TGT GGT GTG GGC GAG TTC CGC 8299 
Asp Glu lie Pro Cys Asn Lys Thr Ala Cys Gly vS Gly Su Z Sg ' 
2600 2605 2610 

TGC CGG GAC GGG ACC TGC ATC GGG AAC TCC AGC CGC TGC AAC CAG TTT 8347 
Cys Arg Asp Gly Thr Cys lie Gly Asn Ser Ser Arg Cys Asn Gin III 
2615 2620 2625 

GTG GAT TGT GAG GAC GCC TCA GAT GAG ATG AAC TGC AGT GCC ACC GAC 8395 
Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys Ser Ala Thr Asp 
2630 2635 . 2640 

TGC AGC AGC TAC TTC CGC CTG GGC GTG AAG GGC GTG CTC TTC CAG CCC 8443 

yS ,f f c Ser Tyr Phe Leu G1 * Val tys Gly Val Leu Phe Gin Pro 
2645 2650 2 655 

£?, C ° AG n GG m CC . CTC TGC TAC GCA CCC ^ TGG GTG TGT GAT GGC 8491 

p Gly 
2675 



Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp Val Cys Asp Gly 

2665 2670 

GCC AAT GAC TGT GGG GAC TAC AGT GAT GAG CGC GAC TGC CCA GGT GTG 8539 
Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp Cys Pro Gly Val 
2680 2685 2690 

AAA CGC CCC AGA TGC CCT CTG AAT TAC TTC GCC TGC CCT AGT GGG CGC 8587 
Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys Pro Ser Gly Arg 
2695 2700 2 705 

TGC ATC CCC ATG AGC TGG ACG TGT GAC AAA GAG GAT GAC TGT GAA CAT 8635 
Cys lie Pro Met Ser Trp Thr Cys Asp Lys Glu Asp Asp Cys Glu His 
2710 2715 2720 

GGC GAG GAC GAG ACC CAC TGC AAC AAG TTC TGC TCA GAG GCC CAG TTT 8683 
Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser Glu Ala Gin Phe 
2725 2730 2735 

GAG TGC CAG AAC CAT CGC TGC ATC TCC AAG CAG TGG CTG TGT GAC GGC 8731 
Glu Cys Gin Asn His Arg Cys lie Ser Lys Gin Trp Leu Cys Asp Gly 
2740 2745 2750 2755 

AGC GAT GAC TGT GGG GAT GGC TCA GAC GAG GCT GCT CAC TGT GAA GGC 8779 
Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala His Cys Glu Gly 
2760 2765 2770 

AAG ACG TGC GGC CCC TCC TCC TTC TCC TGC CCT GGC ACC CAC GTG TGC 8827 
Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly Thr His Val Cvs 
2775 . 2780 2 785 

GTC CCC GAG CGC TGG CTC TGT GAC GGT GAC AAA GAC TGT GCT GAT GGT 8875 
Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp Cys Ala Asp Gly 
2790 2795 2800 

GCA GAC GAG AGC ATC GCA GCT GGT TGC TTG TAC AAC AGC ACT TGT GAC 8923 
Ala Asp Glu Ser He Ala Ala Gly Cys Leu Tyr Asn Ser Thr Cys Asp 
2805 2810 2815 

GAC CGT GAG TTC ATG TGC CAG AAC CGC CAG TGC ATC CCC AAG CAC TTC 8971 
Asp Arg Glu Phe Met Cys Gin Asn Arg Gin Cys He Pro Lys His Phe 
28 20 2825 2830 2835 
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1% °£ SK J* g 2J J""" J» GAT MC TCC CCC ,01, 
2840 • G1 y Ser As P Glu Ser Pro 

2845 2850 

si s: gs s s gs s s r « « «* *« «* •» *>« 

2855 2860 Arg -°y s Ala 



2865 



5 Sjg ^ sss a g S 5 5 - s - g 

w 2875 2880 

?ys 25 2S Gin Ser So G^ !? S CC ^ CCA CAC TGC W AGC 
2885 P P 2890 ly ' ^Jlg HiS CyS Thr S * r 

Pro 22 ££ £s Ss £n S I CG I* CAG " C CTG TGC AGC AGT 655 
2900 y ^ Ala Ser Set Gln Phe Leu <*■ Ser Ser Gly 

2910 2915 

S SI Va! S 2S A 2 S F° l G ° ** C ™ GAT ™ *» «* 
9 yS Val Ala Leu Leu C V S Asn Gly Gin Asp Asp Cys Gly 

2920 2925 2930 

5 £ ess g I? gs s s s a « £ « « 

2935 2940 2945 

AAG CTC AGT GGC TGC AGC CAG GAC TGT GAG GAC CTC AAG ATC GGC TTC 
Lys Leu sej Gly Cys Ser Gin Asp Cys Glu As P Leu %s ill Sy 55 

29S5 , 2960 

Lys Cys ArS Cvs S S S° 11° CTG **« GAT GAC GGC CGG ACG 
2965 y 9 Phe Ar? ieu As P As ? G ^ Arg Thr 

2970 2975 

Ss" S Aso 22 ? AC rf J 00 AGC ACC ACC TTC CGC TGC AGC CAG CGC 
Cys Ala Asp Val Asp Glu Cys Ser Thr Thr Phe Pro Cys Ser Gin Arg 

2985 2990 2995 

TGC ATC AAC ACC CAT GGC AGC TAT AAG TGT CTG TGT GTG GAG GGC TAT 
Cys lie Asn Thr His Gly Ser Tyr Lys Cys Leu Cys vlt III g gj 
3000 3005 3010 

a2 SS So Sv 25 GAC 2°° ^ C AGC TGC "» 001 GTG AC * «= GAG 9547 
Ala Pro Arg Gly Gly Asp Pro His Ser Cys. Lys Ala Val Thr Asp Glu 

3015 3020 3025 

25 22 22 rl Q . A T? I™ 6CC **C 060 TAG TAG CTG CGC AAG CTC AAC 9595 

u Arg 
3040 

S ^ SR K I AC *? G ? TA CTT CAG GGC CTG AAC AAC GCC * 9643 

i Gly 
3055 

52 £a 25 SI III ? ° ^° ? A GAG CAG ATG ATC TAC TGG ACA GAT ■ - 9691 
Val Ala Leu Asp Phe Asp Tyr Arg Glu Gin Met He Tyr Trp Thr Asp 

3060 3065 3070 * 3075 



Glu Pra Phi t 1m 7, . C CGG TAC TAC CTG CGC AAG CTC AAC 

Glu troche ^u lie Phe Ala^Asn Arg Tyr Tyr Leu Arg Lys Leu Asn 

Leu h£ tit ES K J AC AGG T A CTT mG CMS GGC CfG ^ AAC GCC 
3045 3^50 ^ Gln ^¥ LeU ^ n A5m Ala 
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El !£ S g£ S S° C S*? CGA AGG ATG CAC CTT « GGG AGC 
" hr Tht Gln ,!^ Sr Met IIe Arg Met His Leu Asn Gly Ser 
3080 3085 3090 

£ 5? g? g? 2 GGC CTC AGC «* CCC GGG CTG 

Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn Pro Asp Gly Leu 

3095 3100 3105 

111 v T ? GAC l GG f/? 6GT GGC MC CTG TAC TGG TGC GAC AAA GGC CGG 
Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys Asp Lys Gly Arg 
3110 3115 3120 

Sn tTo of S T ? c CC *** CTC MT 66,3 GCC TAT CGG AGG G TG CTG 

^.JS Ser Lys Leu Asn Gly Ala Tyr Arg Thr Val Leu 

3125 3130 3135 ' 

SI? W S3 ^° r T ° CGT GAG CCC AGG GCT CTG GTG G * G GAT GTG CAG 
31?0 , A 5? G1U Pr ° *■« **« Val Val A«P v al Gin 



3145 



3150 



3155 



AAT GGG TAC CTG TAC TGG ACA GAC TGG GGT GAC CAT TCA CTG ATC GGC 
Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His Ser Leu lie Gly 
3160 3165 3170 

CGC ATC GGC ATG GAT GGG TCC AGC CGC AGC GTC ATC GTG GAC ACC AAG 
Arg He Gly Met Asp Gly Ser Ser Arg Ser Val He Val Asp Thr Lys 
3175 3180 3185 

ATC ACA TGG CCC AAT GGC CTG ACG CTG GAC TAT GTC ACT GAG CGC ATC 
lie Thr Trp Pro Asn Gly Leu Thr Leu Asp Tyr Val Thr Glu Arg lie 
3190 3195 3200 

TAC TGG GCC GAC GCC CGC GAG GAC TAC ATT GAA TTT GCC AGC CTG GAT 

yE o^? Ala Asp Ala Ar 9 Glu As P He Glu Phe Ala Ser Leu Asp 
3205 3210 3215 

GGC TCC AAT CGC CAC GTT GTG CTG AGC CAG GAC ATC CCG CAC ATC TTT 
Ser Asn Ar< 3 His Val Va l Leu Ser Gin Asp He Pro His He Phe 
3220 3225 32 30 3235 

GCA CTG ACC CTG TTT GAG GAC TAC GTC TAC TGG ACC GAC TGG GAA ACA 
Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr Asp Trp Glu Thr 
3240 3245 3250 

AAG TCC ATT AAC CGA GCC CAC AAG ACC ACG GGC ACC AAC AAA ACG CTC 
Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Thr Asn Lys Thr Leu 
3255 3260 3265 

CTC ATC AGC ACG CTG CAC CGG CCC ATG GAC CTG CAT GTC TTC CAT GCC 
Leu He Ser Thr Leu His Arg Pro Met Asp Leu His Val Phe His Ala 
3270 3275 . 3280 

CTG CGC CAG CCA GAC GTG CCC AAT CAC CCC TGC AAG GTC AAC AAT GGT 

, A o? Gln Pro Asp Val Pro Asn His Pro c y s L ys Val Asn Asn Gly 
3285 3290 3295 

GGC TGC AGC AAC CTG TGC CTG CTG TCC CCC GGG GGA GGG CAC AAA TGT 
JZiX Cys Ser Asn Leu c y s Leu Leu Ser Pro Gly Gly Gly His Lys Cys 
3300 3305 3310 3315 
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S Ivs Pro fhr £S PH C f C P G ^ C AG ° GAT GGG CGC * CC * GT GTG "459 
Ala Cys Pro Thr Asn Phe Tyr Leu Gly Ser Asp Gly Arg Thr Cys Val 

3320 3325 3330 

Ifr i£h S? c GC ^ G T " GTA TGC ^ A** <* C ^AG TGC ATC 10507 

Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn Asp Lys Cys lie 

33 « 3340 3345 

CCC T?C TGG TGG AAG TGT GAC ACC GAG GAC GAC TGC GGG GAC CAC TCA 10555 
Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys Gly Asp His Ser 
3350 3355 3360 

GAC GAG CCC CCG GAC TGC CCT GAG TTC AAG TGC CGG CCC GGA CAG TTC 10603 
Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg Pro Gly Gin Phe 

337Q 3375 

CAG TGC TCC ACA GGT ATC TGC ACA AAC CCT GCC TTC ATC TGC GAT GGC 10651 
Gin Cys Ser Thr Gly lie Cys Thr Asn Pro Ala Phe He Cys Asp Gly 
3380 33 *5 3390 3395 

GAC AAT GAC TGC CAG GAC AAC AGT GAC GAG GCC AAC TGT GAC ATC CAC 10699 
Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn Cys Asp lie His • 
34 00 3405 3410 

GTC TGC TTG CCC AGT CAG TTC AAA TGC ACC AAC ACC AAC CGC TGT ATT 10747 
Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr Asn Arg Cys lie 
3415 3420 3425 

CCC GGC ATC TTC CGC TGC AAT GGG CAG GAC AAC TGC GGA GAT GGG GAG 10795 
Pro Gly lie Phe Arg Cys Asn Gly Gin Asp Asn Cys Gly Asp Gly Glu 
3430 3435 3440 

GAT GAG AGG GAC TGC CCC GAG GTG ACC TGC GCC CCC AAC CAG TTC CAG 10843 
Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro Asn Gin Phe Gin 
3 <45 3450 3455 

TGC TCC ATT ACC AAA CGG TGC ATC CCC CGG GTC TGG GTC TGC GAC CGG 10891 
Cys Ser He Thr Lys Arg Cys. He Pro Arg Val Trp Val Cys Asp Arg 
3460 34 65 3470 3475 

GAC AAT GAC TGT GTG GAT GGC AGT GAT GAG CCC GCC AAC TGC ACC CAG 10939 
Asp Asn Asp Cys Val Asp Gly Ser Asp Glu Pro Ala Asn Cys Thr Gin 
3480 3485 3490 

ATG ACC TGT GGT GTG GAC GAG TTC CGC TGC AAG GAT TCG GGC CGC TGC 10987 
Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp Ser Gly Arg Cys 
3495 3500 3505 

ATC CCA GCG CGT TGG AAG TGT GAC GGA GAG GAT GAC TGT GGG GAT GGC 11035 
He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp Cys Gly Asp Gly 
3510 3515 3520 

TCG GAT GAG CCC AAG GAA GAG- TGT GAT GAA CGC ACC TGT GAG CCA TAC 11083 
Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr Cys Glu Pro Tyr 
3525 3530 3535 

CAG TTC CGC. TGC AAG AAC AAC CGC TGC GTG CCC GGC CGC TGG CAG TGC 11131* 
Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly Arg Trp Gin Cys 
354 0 3545 3550 3555 
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Asp 5? Sp £n So SI ?° ^ TC<: GM GAA GAG AGC TGC ACC 
P Yr A?P CyS Gly As P Asn 3 ||| As P Gl« G lu Ser Cys Thr 

CCT CGG CCC T6C TCC GAG AGT GAG TTC TCC TGT GCC AAC GCC rrr Trr 
Pro Arg Pro cys Ser Glu Ser Glu Phe Ser Cys £ £n Gly Arg Cys 
00 3 580 35Q5 

A I° 22 ??S * GG *** ™ C GM 6GA GAC GAC TGC GCG GAC GGC 

3 SS 9 P ^ Gly ASP His As P <* s Asp Gly 

■» 3 » u 3595 > 3600 

TCG GAC GAG AAA GAC TGC ACC CCC CGC TGT GAC ATG GAC CAG TTC CAG 
Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Me? Sp §in S Sn 
■ awD 36 ^0 3 615 

ss s s s g? c sk A r s cc ctg cgg tgg 060 tgt gac «» 

cys Lys Ser Gly His Cys He Pro Leu Arg Trp Arg Cys Asp Ala Asp 
Jt>Z ° 362S 3630 3635 

GCC GAC TGC ATG GAC GGC AGC GAC GAG GAG GCC TGC GGC ACT GGC GTG 
Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys Gly Thr Gly Si 
3640 3645 3650 

GGG AGC l GC GCC CTG GAC GAG TTC CAG TGC AAC AAC ACC TTG TGC AAG 
Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn Thr Leu Cys Lys 
3655 3660 366S 

CCG CTG GCC TGG AAG TGC GAT GGC GAG GAT GAC TGT GGG GAC AAC TCA 
Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys Gly Asp Asn Ser 
3670 3675 3680 

GAT GAG AAC CCC GAG GAG TGT GCC CGG TTC GTG TGC CCT CCC AAC CGG 

*\«c A5n Pf ° Glu Glu Ala Ar * ph « Cys Pro Pro Asn Arg 
3585 3690 3695 

CCC TTC CGT TGC AAG AAT GAC CGC GTC TGT CTG TGG ATC GGG CGC CAA 
Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp lie Gly Arg Gin 
3700 3705 3710 " 3715 

TGC GAT GGC ACG GAC AAC TGT GGG GAT GGG ACT GAT GAA GAG GAC TGT 
Cys Asp Gly Thr Asp Asn Cys Gly Asp Gly Thr Asp Glu Glu Asp Cys 
37 20 3725 3730 

GAG CCC CCC ACA GCC CAC ACC ACC CAC TGC AAA GAC AAG AAG GAG TTT 
Glu Pro. Pro Thr Ala His Thr Thr His Cys Lys Asp Lys Lys Glu Phe 
3735 3740 ' 3745 

CTG TGC CGG AAC CAG CGC TGC CTC TCC TCC TCC CTG CGC TGC AAC ATG 
fceu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu Arg Cys Asn Met 
3? S0 3755 3760 

TTC GAT GAC TGC GGG GAC GGC TCT GAC GAG GAG GAC TGC AGC ATC GAC 
Phe „f? As P c * s G1 V As P Cly Ser Asp Glu Glu Asp Cys Ser He Asp 
3765 3770 3775 

CCC AAG CTG ACC AGC TGC GCC ACC AAT GCC AGC ATC TGT GGG GAC GAG 
Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser lie Cys Gly Asp Glu 
3780 3 7 8S 3790 3795 
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S £g Cys 511 Arc" Jg S® J** GCG GCG TAC TGT GCC * GC CGC TCG l"M 
Ala Arg Cys Val^Arg Thr Glu Lys Ala Ala Tyr Cys Ala Cys Arg Ser 

3805 3810 

iSS Se ££ ■£ 2? Pro Sf ?* 2?* TGC CAA GAC ATC ^ GAG 
• y Phe HiS ,*V* Val Pro G1 y Gin *«» Gly Cys Gin Asp lie Asn Glu 

3815 3820 3 8 25 . 

lit 2S ££ SS ?K ^k C 2 GG TCC CAG 070 TGC ** G AAC ACC AAG GGC U995 
Cys leu Arg Phe Gly Thr Cys Ser Gin Leu Cys Asn Asn Thr Lys Gly 

3830 3835 3840 

SS SK fin ES of J GC GCT CGG ^ TTC AT G AAG ACQ CAC AAC ACC l2 043 

V ™£ ^ S6r CyS , A ^ Arg Asn pha Met Lys Th * His Asn Thr 

3845 38S0 3355 

cS fi? S* S** I CT S AG TAC CAG GTC GTG TAC ATC GCT GAT GAC 12091 

Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr He Ala Asp Asp 
3860 3865 3870 ^ 38? f 

SS tT C f GC A6 ° f TG T l C CCC GGG CAC CCC TC <5 GCT TAC GAG 12139 

Asn Glu lie Arg Ser Leu Phe Pro Gly His Pro His Ser Ala Tyr Glu 

3880 3885 ... 3890 

°.? A 11° GGT GAC GAG AGT G TC CGC ATT GAT GCT ATG GAT GTC 12187 

Gin Ala Phe Gin Gly Asp Glu Ser Val Arg He Asp Ala Met Asp Val 
3895 3900 3905 

CAT GTC AAG GCT GGC CGT GTC TAT TGG ACC AAC TGG CAC ACG GGC ACC 12235 
Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp His Thr Gly Thr 
3910 3915 3920 

ATC TCC TAC CGC AGC CTG CCA CCT GCT GCG CCT CCT ACC ACT TCC AAC 12283 

Xl# Tyc Arg Ser Leu Pro Pro Ala Ala Pro Pro Thr Thr Ser Asn 
3925 3930 3935 

CGC CAC CGG CGA CAG ATT GAC CGG GGT GTC ACC CAC CTC AAC ATT TCA 12331 
Reg His Arg Arg Gin lie Asp Arg Gly Val Thr His Leu Asn tie Ser 
3940 3945 3gso 3g55 

GGG CTG AAG ATG CCC AGA GGC ATC GCC ATC GAC TGG GTG GCC GGA AAC 12379 
Gly Leu Lys Met Pro Arg Gly He Ala lie Asp Trp Val Ala Gly Asn ' 
3960 3965 3970 

GTG TAC TGG ACC GAC TCG GGC CGA GAT GTG ATT GAG GTG GCG CAG ATG 12427 
Val Tyr Trp Thr Asp Ser Gly Arg Asp Val lie Glu Val Ala Gin Met 
3975 3980 3935 

SS ^ £v G 2*° CGC ^ ACG CTC ATC TCG GGC ATG ATT GAC GAG CCC 12475 
Lys Gly Glu Asn Arg Lys Thr Leu lie Ser Gly Met He Asp Glu Pro 
3990 3995 4000 

CAC GCC ATT GTG GTG GAC CCA CTG AGG GGG ACC ATG TAC TGG TCA GAC 12523 

, A ; a le al Val As P Pro Leu Ar 9 G1 y Thr Met Tyr Trp Ser Asp 
4005 4010 4015 

TGG GGC AAC. CAC CCC AAG ATT GAG ACG GCA GCG ATG GAT GGG ACG CTT 12571 

mZZR y AS " HlS Pro Lys Ile 61u Thr Ala Ala Met Asp Gly Thr Leu 
4020 4025 4030 4035 
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SS tu A GT ? ^ G GAC MC ATT CA(5 T<3G ACA GGC CTG GCC 12619 

Arg Glu Thr Leu Val Gin Asp Asn lie Gin Trp Pro Thr Gly Leu Ala 
4040 404S 4050 

GTG GAT TAT CAC AAT GAG CGG CTG TAC TGG GCA GAC GCC AAG CTT TCA 12667 
Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Aso Ala Lys Leu Ser 
4055 4060 . ' \ 4065 

GTC ATC GGC AGC ATC CGG CTC AAT GGC ACG GAC CCC ATT GTG GCT GCT 12715 
Val lie Gly Ser lie Arg Leu Asn Gly Thr Asp Pro lie Val Ala Ala 
4070 4075 4080 

GAC AGC AAA CGA GGC CTA AGT CAC CCC TTC AGC ATC GAC GTC TTT GAG : 12763 
Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser lie Asp Val Phe Glu 
4085 4090 4095 

GAT TAC ATC TAT GGT GTC ACC TAC ATC AAT AAT CGT GTC TTC AAG ATC 12811 
Asp Tyr lie Tyr Gly Val Thr Tyr lie Asn Asn Arg Val Phe Lys He 
4100 4105 4110 4115 

CAT AAG TTT GGC CAC AGC CCC TTG GTC AAC CTG ACA GGG GGC CTG AGC 12859 
His Lys Phe Gly His Ser Pro Leu Val Asn Leu Thr Gly Gly Liu Set 
4120 4125 4130 

CAC GCC TCT GAC GTG GTC CTT TAC CAT CAG CAC AAG CAG CCC GAA GTG 12907 
His Ala Ser Asp Val Val Leu Tyr His Gin His Lys Gin Pro Glu Val 
4135 4140 4145 

ACC AAC CCA TGT GAC CGC AAG AAA TGC GAG TGG CTC TGC CTG CTG AGC 12955 
Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu Cys Leu Leu Ser 
4150 4155 4160 

CCC AGT GGG CCT GTC TGC ACC TGT CCC AAT GGG AAG CGG CTG GAC AAC 13003 
Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys Arg Leu Asp Asn 
4165 4170 4175 

GGC ACA TGC GTG CCT GTG CCC TCT CCA ACG CCC CCC CCA GAT GCT CCC 13051 
Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro Pro Asp Ala Pro 
4180 4185 4190 4195 

CGG CCT GGA ACC TGT AAC CTG CAG TGC TTC AAC GGT GGC AGC TGT TTC 13099 
Arg Pro Gly Thr Cys Asn Leu Gin Cys Phe Asn Gly Gly Ser Cys Phe ' 
4200 4205 4210 

CTC AAT GCA CGG AGG CAG CCC AAG TGC CGC TGC CAA CCC CGC TAC ACG 13147 
Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin Pro Arg Tyr Thr 
4215 4220 4225 

GGT GAC AAG TGT GAA CTG GAC CAG TGC TGG GAG CAC TGT CGC AAT GGG 13195 
Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu His Cys Arg Asn Gly 
4230 4235 4240 

GGC ACC TGT GCT GCC TCC CCC TCT GGC ATG CCC ACG TGC CGG TGC CCC 13243 
Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr Cys Arg Cys Pro 
4245 4250 4255 

ACG GGC TTC ACG GGC CCC AAA TGC ACC CAG CAG GTG TGT GCG GGC TAC 13291 
Thr Gly Phe Thr Gly Pro Lys Cys Thr Gin Gin Val Cys Ala Gly Tyr 
4260 4265 4270 4275 
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111 Ala Jin HZ ^° l G ° -S* GTC MC 015 GGC ^ C CAG CCC CAG 13339 

Cys Ala Asn Asn Ser Thr Cys Thr Val Asn Gin Gly Asn Gin Pro Gin 

4280 4285 4290 

IT* £J ^ C f TA ?° 2?° 11° CTG GGC GAC TGC TAC CGG CAG 13387 
Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys Gin Tyr Arg Gin 
429S 4300 4305 

TGC TCT GGC TAC TGT GAG AAC TXT GGC ACA TGC CAG ATG GCT GCT GAT 13435 
Cys Ser Gly Tyr Cys Glu Asn Phe Gly Thr Cys Gin Met Ala Ala Asp 
4310 4315 4320 

GGC TCC CGA CAA TGC CGC TGC ACT GCC TAC TTT GAG GGA TCG AGG TGT 13483 

Arg Gln Cys Arg Cys Thr Ma p he Glu Gly Ser Arg Cys 
4325 4330 4335 

r^f, S*? ^ C 1°° AGC CGC TGT CTC GAA <* G CCC TGT GTG GTC AAC 13531 
a* a* hys Cys Ser Ar ? Cys Leu Glu Gly Ala Cys Val Val Asn 

4340 4345 4350 * 43S5 

AAG CAG ACT GGG GAT GTC ACC TGC AAC TGC ACG GAT GGC CGG GTG GCC 13579 
Lys Gin Ser Gly Asp Val Thr Cys Asn Cys Thr Asp Gly Arg Val Ala 
4360 4365 4370 

CCC AGC TGT CTG ACC TGC GTC GGC CAC TGC AGC AAT GGC GGC TCC TGT 13627 
Pro Ser Cys Leu Thr Cys Val Gly His Cys Ser Asn Gly Gly Ser Cys 
4375 438O 4335 

ACC ATG AAC AGC AAA ATG ATG CCT GAG TGC CAG TGC CCA CCC CAC ATG 13675 
Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys Pro Pro His Met 
4390 4395 4400 

ACA GGG CCC CGG TGT GAG GAG CAC GTC TTC AGC CAG CAG CAG CCA GGA 13723 

„ G i y Pr ° Arg Cys Glu Glu His Val Phe ^er Gin Gin Gin Pro Gly 
4405 4410 4415 

CAT ATA GCC TCC ATC CTA ATC CCT CTG CTG TTG CTG CTG CTG CTG GTT 13771 
His lie Ala Ser lie Leu lie Pro Leu Leu Leu Leu Leu Leu Leu Val 
4420 4425 4430 4435 

CTG GTG GCC GGA GTG GTA TTC TGG TAT AAG CGG CGA GTC CAA GGG GCT 13819 
Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg Val Gin Gly Ala 
4440 4445 4450 

AAG GGC TTC CAG CAC CAA CGG ATG ACC AAC GGG GCC ATG AAC GTG GAG 13867 
Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala Met Asn Val Glu 
4455 446O 4465 

ATT GGA AAC CCC ACC TAC AAG ATG TAC GAA GGC GGA GAG CCT GAT GAT 13915 
lie Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly Glu Pro Asp Asp 
4470 4475 4480 

GTG GGA GGC CTA CTG GAC GCT GAC TTT GCC CTG GAC CCT GAC AAG CCC 13963 
Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp Pro Asp Lys Pro 
4485 4490 4495 * 

ACC AAC TTC ACC AAC CCC GTG TAT GCC ACA CTC TAC ATG GGG GGC CAT 14011 
Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr Met Gly Gly His 
4500 4505 4510 4515 
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r?5 «tl l° C CTG GCC AGC ACG GAC GAG ** G CGA GAA CTC CTG 14059 

Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys Arg Glu Leu Xeu 

4520 4525 4530 

fSSJESSlJEF. CCT GAG GAC GAG ATA GGG GAC GCC TTG G CA TAGGGCCCTG CC 14110 
CCGTCGGACT GCCCCCAGAA AGCCTCCTGC CCCCTGCCGG TGAAGTCCTT CAGTGAGCCC 14170 
Gly Arg Gly Pro Glu Asp Glu lie Gly Asp Pro Leu Ala 
4535* 4540 



CTCCCCAGCC 
CATTTTATAT 
CCTGCCTGCT 
GCTGCACCTC 
TCCCCTCCCT 
CCGCTCCCAC 
CTGGAAGACG 
GGAGGCCACC 
CCCCCCTCCC 
AGACTTCCtC 
GCCACACCCA 
TTTTGCTGAA 
AAAAAA 



AGCCCTTCCC 
GTGAGCGAGC 
CCTTGGCACC 
CTACCCTCCC 
GTATAAGACA 
AGCTTCCTGA 
TGGCTCTGGG 
CCAAACCCCA 
AGACAGGCCC 
TGGTAAACAt 
GGAAGGGAAA 
TTCTTTACAA 



TGGCCCCGCC 
AAGCCGGCAA 
CCCATGCTGC 
ACCAGAACGC 
CTTTGCCAAG 
GGGCTAATTC 
TGAGGTAGGC 
GCCCCAACTC 
TCCCTGTCTC 
TCCTCCAGCC 
GCGGGCAGCC 
CTAAATAACA 



GGATGTATAA 
GCGAGCACAG 
CTTCAGGGAG 
ACCCCACTGG 
GCTCTCCCCT 
TGGGAAGGGA 
GGGAAAGGAT 
CAGGGGCACC 
CAGGGCCCCC 
TCCCCTCCCC 
CCGTTTTGGG 
CAGATATTCr 



ATGTAAAAAT 
TATTATTTCT 
ACAGGCAGGG 
GAGAGCTGGT 
CTCGCCCCAT 
GAGTTCTTTG 
GGAGTGTTTT 
TATGAGATGG 
ACCGAGGTTC 
TGGGGACGCC 
GACGTGAACG 
TATAAATAAA 



GAAGGAATTA 
CCATCCCCTC 
AGGGCTTGGG 
GGTGCAGCCT 
CCCTGCTTGC 
CTGCCCCTGT 
AGTTCTTGGG 
CCATGCTCAA 
CCAGGGCTGG 
AAGGAGGTGG 
TTTTAATAAT 
ATTGTAAAAA 
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2yr His Gin Arg Arg Gin Pro Arg Val Arg Ser His Ala Cys Glu Asn 

470 475 4fi0 

Asp Gin Tyr Gly Lys Pro Gly Gly Cys Ser Asp lie Cys Leu Leu Ala 

485 490 495 

Asn Ser His Lys Ala Arg Thr Cys Arg Cys Arg Ser Gly Phe Ser Leu 

500 505 Sl0 

Gly Ser Asp Gly Lys Ser Cys Lys Lys Pro Glu His Glu Leu Phe Leu 

515 . 520 525 

Val Tyr Gly Lys Gly Arg Pro Gly He He Arg Gly Met Asp Met Gly 

530 535 540 

Ala Lys Val Pro Asp Glu His Met He Pro He Glu Asn Leu Met Asn 
545 550 555 560 

Pro Arg Ala Leu Asp Phe His Ala Glu Thr Gly Phe He Tyr Phe Ala 

565 570 575 

Asp Thr Thr Ser Tyr Leu He Gly Arg Gin Lys He Asp Gly Thr Glu 

580 S85 590 

Arg Glu Thr lie. Leu Lys Asp Gly He His Asn Val Glu Gly Val Ala 

59 S 600 605 

Val Asp Trp Met Gly Asp Asn Leu Tyr Trp Thr Asp Asp Gly Pro Lys 

610 615 620 

Lys Thr He Ser Val Ala Arg Leu Glu Lys Ala A.la Gin Thr Arg Lys 
625 ' 630 635 640 

Thr Leu He Glu Gly Lys Met Thr His Pro Arg Ala He Val Val Asp 

645 650 655 

Pro Leu Asn Gly Trp Met Tyr Trp Thr Asp Trp Glu Glu Asp Pro Lys 

660 665 670 

Asp Ser Arg Arg Gly Arg Leu Glu Arg Ala Trp Met Asp Gly Ser His 

675 680 685 

Arg Asp He Phe Val Thr Ser Lys Thr Val Leu Trp Pro Asn Gly Leu 

690 695 700 

Ser Leu Asp He Pro Ala Gly Arg Leu Tyr Trp Val Asp Ala Phe Tyr 
705 710 715 * 720 

Asp Arg He Glu Thr He Leu Leu Asn Gly Thr Asp Arg Lys He Val 

725 730 735 

Tyr Glu Gly Pro Glu Leu Asn His Ala Phe Gly Leu Cys His His Gly 

740 745 750 

Asn Tyr Leu Phe Trp Thr Glu Tyr Arg Ser Gly Ser Val Tyr Arg Leu 

755 760 765 

Glu Arg Gly Val Gly Gly Ala Pro Pro Thr Val Thr Leu Leu Arg Ser 

770 775 780 

Glu Arg Pro Pro He Phe Glu He Arg Met Tyr Asp Ala Gin Gin Gin 
765 790 795 800 

Gin Val Gly Thr Asn Lys Cys Arg Val Asn Asn Gly Gly Cys Ser Ser 

80S 810 " 815 

Leu Cys Leu Ala Thr Pro Gly Ser Arg Gin Cys Ala Cys Ala Glu Asp 

8 20 825 830 

Gin Val Leu Asp Ala Asp Gly Val Thr Cys Leu Ala Asn Pro Ser Tyr 
835 840 " 845 
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Sla Cys Ser fik'Ara The Cys Pro Pfo As/i Gin Phe Ser CysMla 
Ser Gly Arg .Cys Tie : Pro Xl e \Ser^^Thr 'Sys >Asp r J^»" *«« airrfraU 

D 'Ara . Serf- Aso Gl i i':> rA Ala 'ted)*':'/**** ttl a « 



;Leu Asp Asp^ftsp 




1110 tf*?.^- V/*£lil5 *v ' ' ~ *>fm 
Lys Ala Trp Val Cys Asp Gly Asp Asn Asp Cys Glu Asp Asn Ser'^Asp 
1125 . > /Vi- 1130, • 1135?^ 

Giu Glu Asn Cys Glu Ser Let/ Ala /Cys Arc Pro Pro Ser His £ro^ 

■ 1140 1145. .* * *v 1150 

Ala Asn Ma' Ihr Ser Val Cys Leu Pro Pro Aso Lys Leu Cvs As^ly 

1155 : 1160 ■■ • V - 1165 

Asn Asfi Asa Cys gly Asp. Gly Ser ^£ Siu. gly Glu Leu Cys As£ Gin 

H70 1175 1180 • 

Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala Pro 
185 H90 1195 1200 

Gly Glu Gly He Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly Pro 

1205 1210 1215 

Asp Asn His Thr Cys Gin He Gin Ser Tyr Cys Ala Lys His Leu Lys 

1220 1225 ' 1230 

Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser Cys 

1235 1240 1245 

Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Ser Cys Arg Ser Leu 

1250 1255 1260 

Asp Pro Phe Lys Pro Phe He He Phe Ser Asn Arg His Glu lie Arg 
265 12*70 1275 1280 

Arg He Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly Leu 

1285 1290 1295 

Arg Asn Thr He Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu Tyr 

1300 1305 1310 

Trp Thr Asp Val Val Glu Asp Lys He Tyr Arg Gly Lys Leu Leu Asp 

1315 1320 1325 

Asn Gly Ala Leu thr Ser Phe Glu Val Val He Gin Tyr Gly Leu Ala 

1330 1335 1340 

Thr Pro Glu Gly Leu Ala Val Asp Trp He Ala Gly Asn He Tyr Trp 
345 1350 1355 . 1360 

Val Glu Ser Asn Leu Asp Gin He Glu Val Ala Lys Leu Asp Gly Thr 

1365 1370 1375 

Leu Arg Thr Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala He 
1380 1385 1390 
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Ala Asp Pro Arg Asp Gly lie Leu Phe Trp Thr Asp Trp Asp Ala 

1400 1405 

Ser 7^n Pr ° Ar9 Ile Glu Ala Ma Ser Met Ser G ^ Ala Gly Arg Arg 

1410 1415 1420 

Thr Val His Arg Glu Thr Gly Ser Gly Gly Trp Pro Asn Gly Leu Thr 
t 7 „ 1430 1435 1440 

Val Asp Tyr Leu Glu Lys Arg Ile Leu Trp Ile Asp Ala Arg Ser Asp 

1445 1450 * 1455 

Ala Ile Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val Leu 

1460 1465 1470 

Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr Gly 
1475 • 1480 1485 

Gly ,?i^ Val Tyr Trp Thr Asp Trp Thr Asn Thr Leu Ala Lys Ala 

1490 1495 1500 

Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn Thr 
505 1510 1515 1520 

Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met Ala 

1525 1530 " 1535 

Pro Asn Pro Cys Glu Ala Asn Gly Gly Gin Gly Pro Cys Ser His Leu 

1540 1545 1550 

Cys Leu lie Asn Tyr Asn Arg Thr Val Ser Cys Ala Cys Pro His Leu 

1555 1560 1565 

Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys Phe 
r 1570 1575 1580 

Leu Leu Tyr Ala Arg Gin Met Glu Ile Arg Gly Val Asp Leu Asp Ala 
585 1590 1595 1600 

Pro Tyr Tyr Asn Tyr Ile Ile Ser Phe Thr Val Pro Asp Ile Asp Asn 

1605 1610 1615 

Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp Ser 

1620 1625 1630 

Asp Val Arg Thr Gin Ala Ile Lys Arg Ala Phe Ile Asn Gly Thr Gly 

1635 1640 1645 

Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu Ala 

1650 1655 1660 

Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr Asn 
665 1670 1675 ' * 1680 

Lys Lys Gin Ile Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn Ala 

1685 1690 1695 * 

Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro Leu 

1700 1705 1710 

Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn Ile Ser Met Ala Asn 

1715 1720 1725 

Met Asp Gly Ser Asn Arg Thr Leu Leu Phe Ser Gly Gin Lys Gly Pro 

1730 1735 1740 

Val Gly Leu Ala Ile Asp Phe Pro Glu Ser Lys Leu Tyr Trp Ile Ser 
74 5 1750 1755 1760 

Ser Gly Asn His Thr Ile Asn Arg Cys Asn Leu Asp Gly Ser Gly Leu 

1765 1770 1775 

Glu Val lie Asp Ala Met Arg Ser Gin Leu Gly Lys Ala Thr Ala Leu 

1780 1785 1790 

Ala lie Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu Lys 

1795 1800 1805 

Met Gly Thr Cys Ser Lys Ala Asp Gly Ser Gly Ser Val Val Leu Arg 

1810 1815 . 1820 

Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser Ile 
825 1830 1835 1840 

Gin Leu Asp His Lys Gly Thr Asn Pro Cys Ser Val Asn Asn Gly Asp 

1845 • 1850 1855 

Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys Met 
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1860 X865 1870 

Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu Gly 
18/5 1880 1885 

Val ,«i£ Ser Phe Leu tey Tyr Ser Val His Glu Gly He Arg Gly He 

1890 1895 1900 

Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser Gly 
905 1910 1915 1920. 

Thr Ser Leu Ala Val Gly He Asp Phe His Ala Glu Asn Asp Thr lie 

1925 1930 1935 

Tyr Trp Val Asp Met Gly Leu Ser Thr He Ser Arg Ala Lys Arg Asp 

1940 1945 i 9 !o 

Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly He Gly Arg Val Glu 
1955 i960 19 6 5 * 

Giy ,£i^ Ala Val Asp Trp lle Ala Gl y Asn Ile T *P ?hr Gin 

1970 1975 3^900 

Gly Phe Asp Val lie Glu Val Ala Arg Leu Asn Gly Ser Phe Arg Tyr 
985 !990 1995 2000 

Val Val Ile Ser Gin Gly Leu Asp Lys Pro Arg Ala lie Thr Val His 

2005 2010 2015 

Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly Gin Tyr Pro Arg 

2020 2025 2030 

Ile Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val Asn 

2035 2040 2045 

Val Ser lie Ser Trp Pro Asn Gly lie Ser Val Asp Tyr Gin Asp Gly 

2050 2055 2060 

Lys Leu Tyr Trp Cys Asp Ala Arg Thr Asp Lys lie Glu Arg He Asp 
065 2070 2075 2080 

Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn Met 

2085 2090 2095 

Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe lie Tyr Trp Ser Asp 

2100 2105 2110 

Arg Thr His Ala Asn Gly Ser Ile Lys Arg Gly Ser Lys Asp Asn Ala 

2115 2120 2125 

Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys Asp 

2130 2135 2140 

He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys Ala 
145 2150 2155 2160 

Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Arg Gly 

2165 2170 2175 

Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly Ala 

2180 2185 2190 

Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr Ile 

2195 2200 2205 

Leu Lys Ser lie His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro Val 

2210 2215 2220 

Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val Ile Ala Leu Ala 
225 2230 2235 2240 

Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He Phe 

2245 2250 2255 

Phe Ser Asp He His Phe Gly Asn Ile Gin Gin He Asn Asp Asp Gly 

2260 2265 2270 

Ser Arg Arg lie Thr lie Val Glu Asn Val Gly Ser Val Glu Gly Leu 

2275 2280 2285 

Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr Thr 

2290 2295 2300 

Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala Phe 
305 2310 2315 2320 

Glu Arg Glu Thr Val Ile Thr Met Ser Gly Asp Asp His Pro Arg Ala 
2325 2330 2335 
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Phe Val Leu Asp Glu Cys Gin Ash Leu Met Phe Trp Thr Asn Trp Asn 

„ „ 2340 2345 23S0 

Glu Gin His Pro Ser lie Met Arg Ala Ala Leu Ser Gly Ala Asn Val 

2355 2360 2365 

Leu Thr Leu He Glu Lys Asp He Arg Thr Pro Asn Gly Leu Ala He 

2370 2375 2380 

Asp His Arg Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp Lys 
3 ? 5 2390 2395 2400 

He Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu Lys 

2405 2410 2415 

Ser Glu Pro Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His He 

2420 2425 . 2430 

Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys His 
2435 2440 2445 

Val J?^ y Ser Asn Met Lys Leu Leu Ar 9 Val Asp He Pro Gin Gin Pro 

2450 2455 2 460 

Met Gly He He Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu Ser 
465 2470 2475 2480 

Pro Cys Arg lie Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu Thr 

2485 2490 2495 

His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu Gin 

2500 2505 " 2510 

Asp Asp Leu Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin Asp 

2515 2520 2525 

Glu Phe Glu Cys Ala Asn Gly Glu Cys He Asn Phe Ser Leu Thr Cys 

2530 2535 2540 

Asp Gly Val Pro His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser Tyr 
545 2550 2555 2560 

Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Ser Asn Gly 

2565 2570 2575 

Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Ala Asp Asp Cys Gly 

2580 2585 2590 

Asp Gly Ser Asp Glu He Pro Cys Asn Lys Thr Ala Cys Gly Val Gly 

2595 .2600 " 2605 

Glu Phe Arg Cys Arg Asp Gly Thr Cys He Gly Asn Ser Ser Arg Cys 

2610 2615 2620 

Asn Gin Phe Val Asp Cys Glu' Asp Ala Ser Asp Glu Met Asn Cys Ser 
625 2630 2635 2640 

Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val Leu 

2645 2650 2655 

Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp Val 

2660 2665 2670 

Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp GluArg Asp Cys 

2675 2680 2685 

Pro Gly Val Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys Pro 

2690 2695 2700 

Ser Gly Arg Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp Asp 
705 2710 2715 2720 

Cys Glu His Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser Glu 

2725 2730 " 2735 

Ala Gin Phe Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp Leu 

2740 2745 2750 

Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala His 

2755 2760 2765 . 

Cys Glu Gly Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly Thr 

2770 2775 2780 

His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp Cys 
7^5 . 2790 2795 2800 

Ala Asp Gly Ala Asp Glu Ser He Ala Ala Gly Cys Leu Tyr Asn Ser 
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2805 2810 2815 

Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Gin Cys lie Pro 

2820 2825 2 830 

Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser Asp 

2835 2840 2845 

Glu Ser Pro Glu Qys Glu Tyr Pro Thr Cys Gly Pro Ser Glu Phe Arg 

2850 2855 '2860 

Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Glh Trp Glu Cys Asp Gly 
865 2870 2875 " 2880 

Glu Asn Asp Cys His. Asp Gin Ser Asp Glu Ala Pro Lys Asn Pro His 

2885 2890' ~ 2895 

Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu Cys 

2900 2905 2910 

Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Asn Gly Gin Asp 

2915 2920 2925 

Asp Cys Gly Asp Ser Ser Asp Glu Arg Gly Cys His lie Asn Glu Cys 

2930 2935 2940 

Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu Lys 
945 2950 2955 2960 

He Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp Asp 

2965 2970 2975 

Gly Arg Thr Cys Ala Asp Val Asp Glu Cys Ser Thr Thr Phe Pro Cys 

2980 2985 2990 

Ser Gin Arg Cys He Asn Thr His Gly Ser Tyr Lys Cys Leu Cys Val 

2995 3000 3005 

Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala Val 

3010 3015 3020 

Thr Asp Glu Glu Pro Phe Leu lie Phe Ala Asn Arg Tyr Tyr Leu Arg 
025 3030 3035 3040 

Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly Leu 

3045 3050 3055 

Asn Asn Ala Val Ala Leu Asp Phe Asp Tyr Arg Glu Gin Met He Tyr 

3060 3065 3070 

Trp Thr Asp Val Thr Thr Gin Gly Ser Met He Arg Arg Met His Leu 

3075 3080 3085 

Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn Pro 

3090 3095 3100 

Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys Asp 
1Q 5 3110 3115 3120 

Lys Gly Arg Asp Thr lie Glu Val Ser Lys Leu Asn Gly Ala Tyr Arg 

3125 3130 3135 

Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val Val 

3140 3145 3150 

Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His Ser 

3155 3160 * 3165 

Leu lie Gly Arg He Gly Met Asp Gly Ser Ser Arg Ser Val He Val 

3170 3175 3180 

Asp Thr Lys He Thr Trp Pro Asn Gly Leu Thr Leu Asp Tyr Val Thr 
185 3190 3195 * 3200 

Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe Ala 

3205 3210 3215 

Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He Pro 

3220 3225 3230 

His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr trp Thr Asp 

3235 3240 3245 

Trp Glu Thr Lys Ser He Asn Arg Ala His Lys Thr Thr Gly Thr Asn 

3250 3255 3260 

Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His Val 
265 3270 3275 3280 
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Phe 




His Pro Cys Lys 
3295 



Val 



Asn Asn Gly Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Glv Glv Glv 

3300 3305 * 3310 

His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Ser Asp Gly Arg 

3315 3320 3325 

Thr Cys Vai Ser Asn Cys thr Ala Ser Gin Phe Val Cys Lys Asn Asp 

3330 3335 3340 

Lys Cys lie Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys Gly 
345 3350 3355 3360 

Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg Pro 

3365 3370 3375 

Gly Gin Phe Gin Cys Ser thr Gly lie Cys Thr Asn Pro Ala Phe lie 

3380 3385 3390 

Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn Cys. 

3395 3400 3405 

Asp lie His Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr Asn 

3410 3415 3420 

Arg Cys lie Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys Gly 
425 3430 3435 3440 

Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro Asn 

3445 3450 3455 

Gin Phe Gin Cys Ser He Thr Lys Arg Cys He Pro Arg Val Trp Val 

3460 34 65 3470 

Cys Asp Arg Asp Asn Asp Cys Val Asp Gly Ser Asp Glu Pro Ala Asn 

3475 3480 3485 

Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp Ser 

3490 3495 3500 

Gly Arg Cys lie Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp Cys 
505 3510 3515 • 3520 

Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr Cys 

3525 3530 3535 

Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly Arg 

3540 3545 " 3550 

Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu Glu 

3555 3560 3565 

Ser Cys Thr Pro Arg Pro Cys Ser Glu Ser Glu Phe Ser Cys Ala Asn 

3570 3575 3580 

Gly Arg Cys lie Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp Cys 
585 3590 3595 3600 

Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met Asp 

3605 3610 " 3615 

Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Arg Cys 

3620 3625 3630 

Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys Gly 

3635 3640 3645 

Thr Gly Val Arg Thr Cys Px-o Leu Asp Glu Phe Gin Cys Asn Asn .Thr 

3650 3655 . 3660 

Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys Gly 
665 3670 3675 " 3680 

Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe Val Cys Pro 

3685 3690 3695 

Pro Asn Arg P*o Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp He 

3700 3705 3710 

Gly Arg Gin Cys Asp Gly Thr Asp Asn Cys Gly Asp Gly Thr Asp Glu 

3715 ( 3720 3725 

Glu Asp Cys Glu Pro Pro Thr Ala His Thr Thr His Cys Lys Asp Lys ■- 

3730 3735 3740 

Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu Arg 
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745 3750 3755 ->7«n 

Cys Asn Met Phe^Asp Asp Cys Gly Asp^ly s" Asp Glu Glu Asp Cys 

Ser lie Asp^ro Lys Leu Thr Ser^ys^U Thr Asn Ala Ser"" Cys 

Gly Asp^lu Ala Arg Cys Val^'iS Glu Lys Ala Ala^yr Cys Ala 

Cys Arg Ser Gly Phe His Thr Vai Pro Gly Gin Pro^y Cys Gin Asp 
■ 50J - U 3815 3820 



lie Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Cys Asn Asn 

830 3835 3840 

Thr Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys Thr 

3845 3850 3855 

His Asn Thr Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr He 
JB50 3865 3870 

Ma ASP ^? Asn Glu lle Ar * Ser Leu p *e Pro Gly His Pro His Ser 
oo/:> 3880 3885 

Ala 3?9S" G1U Ma Phe 3 G 9S Gly ASP G1U S " Val Ar 9 Ile Ala 

Met Asp Val His Val Lys Ala Gly Arg Val Tyr^rp Thr Asn Trp His 
„ „w 3910 3915 3920 

Thr Gly Thr He Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro Thr 
„ „ 3925 393 0 393S 

Thr Ser Asn Arg His Arg Arg Gin He Asp Arg Gly Val Thr His Leu 

3940 3945 3g50 

Asn He Ser Gly Leu Lys Met Pro Arg Gly He Ala He Asp Trp Val 
3955 3960 3965 

Ala ,oiX ASfl V?1 Tyr Trp Thr As P Ser 61 V Ar 9 Asp Val He Glu Val 

3970 397S 398O 

Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met He 

? 85 -„, „ . 3990 399S 4000 

Asp Glu Pro His Ala He Val Val Asp Pro Leu Arg Gly Thr Met Tyr 

. 4005 4010 4015 

Trp Ser Asp Trp Gly Asn His Pro Lys He Glu Thr Ala Ala Met Asp 

4020 4025 4030 

Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn He Gin Trp Pro Thr 

4035 4040 4045 

y Al« Ala Val Asp Tyr His Asn Glu Ar 9 Leu Tyr Trp Ala Asp Ala 
*Oi>0 4055 406O 

Lys Leu Ser Val He Gly Ser He Arg Leu Asn Gly Thr Asp Pro He 
065 4070 4075 4080 

Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He Asp 

4085 4090 4095 

Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr He Asn Asn Arg Val 

4100 4105 4110 

Phe Lys Ile His . Lys Phe Gly His Ser Pro Leu Val Asn Leu Thr Gly 
4115 4120 4125 

y ™ Ser H±S Ala Ser Val Val Leu Gin His Lys Gin 

4130 4135 4140 

Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu Cys 

" 5 4150 4155 4160 

Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys Arg 

4165 4170 4175 

Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro Pro 

4180 4185 4190 

Asp Ala Pro Arg Pro Gly Thr Cys Asn Leu Gin Cys Phe Asn Gly Gly 

4195 . 4200 4205 

Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin Pro 

4210 4215 4220 



FIGc 14B 



WO 01/92474 



91/91 



PCT/US01/18041 



Arg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu His Cys 

4230 4235 4240 

Arg Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr Cys 

4245 4250 4255 

Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Gin Gin Val Cys 

4260 4265 4270 

Ala Gly Tyr Cys Ala Asn Asn Ser Thr Cys Thr Val Asn Gin Gly Asn 

4275 4280 4285 

Gln 4290 ° yS 4295 ^ GlY **** ^ Gly Arg CyS Gln 

Tyr Arg Gln Cys Ser Gly Tyr Cys Glu Asn Phe Gly Thr Cys Gln Met 
305 4310 4^315 4320 

Ala Ala Asp Gly Ser Arg Gln Cys Arg Cys Thr Ala Tyr Phe Glu Gly 

4325 4330 4335 

Ser Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Glu Gly Ala Cys 

434 0 4345 4?50 

Val Val Asn Lys Gln Ser Gly Asp Val Thr Cys Asn Cys Thr Asp Gly 

4355 4360 4365 

Arg Val Ala Pro Ser Cys Leu thr Cys Val Gly His Cys Ser Asn Glv 

4370 4375 4380 

Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gln Cys Pro 
385 4390 . ' 4395 4400 

Pro His Met Thr Gly Pro Arg Cys Glu Glu His Val Phe Ser Gln Gln 

4405 4410 4415 

Gln Pro Gly His lie Ala Ser lie Leu lie Pro Leu Leu Leu Leu Leu 

4420 4425 4430 

Leu Leu Val Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg Val 

4435 . 4440 4445 

Gln Gly Ala Lys Gly Phe Gln His Gln Arg Met Thr Asn Gly Ala Met 

4450 4 455 44 60 

Asn Val Glu lie Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly Glu 
465 4470 4475 4480 

Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp Pro 

4485 4490 4495 

Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr Met 

4500 4505 4510 

Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys Arg 

4515 4520 4525 

Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu lie Gly Asp Pro Leu Ala 
4530 4535 4540 
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SEQUENCE LISTING 



<110> Antigenics* Inc. 

<120> ALPHA (2) MACROGLOBULIN RECEPTOR AS A HEAT SHOCK 
PROTEIN RECEPTOR AND USES THEREOF 

<130> 8449-134-228 

<140> 
<141> 

<150> 09/750,972 
<151> 2000-12-28 

<150> 09/668,724 
<151> 2000-09-22 

<150> 60/209,095 
<151> 2000-06/02 

<160> 57 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 14849 

<212> DNA 

<213> Mus musculus 

<400> 1 

cgctgctccc cgccagtgca ctgaggaggc ggaaacgggg gagcccctag tgctccatca 
ggcccctacc aaggcacccc catcgggtcc acgcccccca ccccccaccc cgcctcctcc 
caattgtgca tttttgcagc cggagtcggc tccgagatgg ggctgtgagc ttcgccctgg 
gagggggaga ggagcgagga gtaaagcagg ggtgaagggt tcgaatttgg gggcaggggg 
cgcacccgcg tcagcaggcc cttcccaggg ggctcggaac tgtaccattt cacctatgcc 
cctggttcgc tttgcttaag gaaggataag atagaagagt cggggagagg aagataaagg 
gggacccccc aattgggggg ggcgaggaca agaagtaaca ggaccagagg gtgggggctg 
ctgtttgcat cggcccacac catgctgacc ccgccgttgc tgctgctcgt gccgctgctt 
tcagctctgg tctccggggc cactatggat gcccctaaaa cttgcagccc taagcagttt 
gcctgcagag accaaatcac ctgtatctca aagggctggc ggtgtgacgg tgaaagagat 
tgccccgacg gctctgatga agcccctgag atctgtccac agagtaaagc ccagagatgc 
ccgccaaatg agcacagttg tctggggact gagctatgtg tccccatgtc tcgtctctgc 
aacgggatcc aggaetgcat ggatggctca gacgagggtg ctcactgccg agagctccga 
gccaactgtt ctcgaatggg ttgtcaacac cattgtgtac ctacacccag tgggcccacg 
tgctactgta acagcagctt ccagctcgag gcagatggca agacgtgcaa agattttgac 
gagtgttccg tgtatggcac ctgcagccag ctttgcacca acacagatgg ctccttcaca 
tgtggctgtg ttgaaggcta cctgctgcaa ccggacaacc gctcctgcaa ggccaagaat 
gagccagtag atcggccgcc agtgctactg attgccaact ctcagaacat cctagctacg 
tacctgagtg gggcccaagt gtctaccatc acacccacca gcacccgaca aaccacggcc 
atggacttca gttatgccaa tgagaccgta tgctgggtgc acgttgggga cagtgctgcc 
cagacacagc tcaagtgtgc ccggatgcct ggcctgaagg gctttgtgga tgagcatacc 
atcaacatct ccctcagcct gcaccacgtg gagcagatgg caatcgactg gctgacggga 
aacttctact ttgtcgacga cattgacgac aggatctttg tctgtaaccg aaacggggac 
acctgtgtca ctctgctgga cctggaactc tacaacccca aaggcatcgc cttggacccc 
gccatgggga aggtgttctt cactgactac gggcagatcc caaaggtgga gcgctgtgac 
atggatggac agaaccgcac caagctggtg gatagcaaga tcgtgtttcc acacggcatc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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accctggacc tggtcagccg cctcgtctac tgggcggacg cctacctaga ctacatcgag 1620 

gtggtagact acgaagggaa gggtcggcag accatcatcc aaggcatcct gatcgagcac 1680 

ctgtacggce tgaccgtgtt tgagaactat ctctacgcca ccaactcgga caatgccaac 1740 

acgcagcaga agacgagcgt gatccgagtg aaccggttca acagtactga gtaccaggtc 1800 

gtcacccgtg tggacaaggg tggtgccctg catatctacc accagcgacg ccagccccga 1860 

gtgcggagtc acgcctgtga gaatgaccag tacgggaagc caggtggctg ctccgacatc 1920 

tgcctcctgg ccaacagtca caaggcaagg acctgcaggt gcaggtctgg cttcagcctg 1980 

ggaagtgatg ggaagtcttg taagaaacct gaacatgagc tgttcctcgt gtatggcaag 2040 

ggccgaccag gcatcattag aggcatggac atgggggcca aggtcccaga tgagcacatg 2100 

atccccatcg agaaccttat gaatccacgc gctctggact tccacgccga gaccggcttc 2160 

atctactttg ctgacaccac cagctacctc attggccgcc agaaaattga tggcacggag 2220 

agagagacta tcctgaagga tggcatccac aatgtggagg gcgtagccgt ggactggatg 2280 
ggagacaatc tttactggac tgatgatggc cccaagaaga ccattagtgt ggccaggctg . 2340 

gagaaagccg ctcagacccg gaagactcta attgagggca agat'gacaca ccccagggcc 24 00 

attgtagtgg atccactcaa tgggtggatg tactggacag actgggagga ggaccccaag 2460 

gacagtcggc gagggcggct cgagagggct tggatggacg gctcacaccg agatatcttt 2520 

gtcacctcca agacagtgct ttggcccaat gggctaagcc tggatatccc agccggacgc 2580 

ctctactggg tggatgcctt ctatgaccga attgagacca tactgctcaa tggcacagac 2640 

cggaagattg tatatgaggg tcctgaactg aatcatgcct tcggcctgtg tcaccatggc 2700 

aactacctct tttggaccga gtaccggagc ggcagcgtct accgcttgga acggggcgtg 2760 

gcaggcgcac cgcccactgt gacccttctg cgcagcgaga gaccgcctat ctttgagatc 2820 

cgaatgtacg acgcgcacga gcagcaagtg ggtaccaaca aatgccgggt aaataacgga 2880 

ggctgcagca gcctgtgcct cgccaccccc gggagccgcc agtgtgcctg tgccgaggac 2940 

caggtgttgg acacagatgg tgtcacctgc ttggcgaacc catcctacgt gcccccaccc 3000 

cagtgccagc cgggccagtt tgcctgtgcc aacaaccgct gcatccagga gcgctggaag 3060 

tgtgacggag acaacgactg tctggacaac agcgatgagg ccccagcact gtgccatcaa 3120 

cacacctgtc cctcggaccg attcaagtgt gagaacaacc ggtgtatccc caaccgctgg 3180 

ctctgtgatg gggataatga ttgtggcaac agcgaggacg aatccaatgc cacgtgctca 3240 

gcccgcacct gtccacccaa ccagttctcc tgtgccagtg gccgatgcat tcctatctca 3300 

tggacctgtg atctggatga tgactgtggg gaccggtccg atgagtcagc ctcatgcgcc 33 60 

taccccacct gcttccccct gactcaattt acctgcaaca atggcagatg tattaacatc 3420 

aactggcggt gtgacaacga caatgactgt ggggacaaca gcgacgaagc cggctgcagt 3480 

cactcctgct ccagtaccca gttcaagtgc aacagtggca gatgcatccc cgagcactgg 3540 

acgtgtgatg gggacaatga ttgtggggac tacagcgacg agacacacgc caactgtacc 3600 

aaccaggcta caagacctcc tggtggctgc cactcggatg agttccagtg cccgctagat 3660 

ggcctgtgca tccccctgag gtggcgctgc gacggggaca ccgactgcat ggattccagc 3720 

gatgagaaga gctgtgaggg cgtgacccat gtttgtgacc cgaatgtcaa gtttggctgc 3780 

aaggactccg cccggtgcat cagcaaggcg tgggtgtgtg atggcgacag cgactgtgaa 3840 

gataactccg acgaggagaa ctgtgaggcc ctggcctgca ggccaccctc ccatccctgc 3900 

gccaacaaca cctctgtctg cctgcctcct gacaagctgt gcgacggcaa ggatgactgt 3960 

ggagacggct cggatgaggg cgagctctgt gaccagtgtt ctctgaataa tggtggctgt 4020 

agtcacaact gctcagtggc ccctggtgaa ggcatcgtgt gctcttgccc tctgggcatg 4080 

gagctgggct ctgacaacca cacctgccag atccagagct actgtgccaa gcacctcaaa 4140 

tgcagccaga agtgtgacca gaacaagttc agtgtgaagt gctcctgcta cgagggctgg 4200 

gtcttggagc ctgacgggga aacgtgccgc agtctggatc ccttcaaact gttcatcatc 4260 

ttctccaacc gccacgagat caggcgcatt gaccttcaca agggggacta cagcgtccta 4320 

gtgcctggcc tgcgcaacac tattgccctg gacttccacc tcagccagag tgccctctac 4380 

tggaccgacg cggtagagga caagatctac cgtgggaaac tcctggacaa cggagccctg 4440 

accagctttg aggtggtgat tcagtatggc ttggccacac cagagggcct ggctgtagat 4500 

tggattgcag gcaacatcta ctgggtggag agcaacctgg accagatcga agtggccaag 4560 

ctggacggaa ccctccgaac cactctgctg gcgggtgaca ttgagcaccc gagggccatc 4620 

gctctggacc ctcgggatgg gattctgttt tggacagact gggatgccag cctgccacga 4680 

atcgaggctg catccatgag tggagctggc cgccgaacca tccaccggga gacaggctct 4740 

gg999 c tgcg ccaatgggct caccgtggat tacctggaga agcgcatcct ctggattgat 4800 

gctaggtcag atgccatcta ttcagcccgg tatgacggct ccggccacat ggaggtgctt 4860 

cggggacacg agttcctgtc acacccattt gccgtgacac tgtacggtgg ggaggtgtac 4920 

tggaccgact ggcgaacaaa tacactggct aaggccaaca agtggactgg ccacaacgtc 4980 

accgtggtac agaggaccaa cacccagccc ttcgacctgc aggtgtatca cccttcccgg 5040 
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cagcccatgg ctccaaaccc atgtgaggcc aatggcggcc ggggcccctg ttcccatctg 5100 

tgcctcatca actacaaccg gaccgtctcc tgggcctgtc cccacctcat gaagctgcac 5160 

aaggacaaca ccacctgcta tgagtttaag aagttcctgc tgtacgcacg tcagatggag 5220 

atccggggcg tggacctgga tgccccgtac tacaattata tcatctcctt cacggtgcct 5280 

gatatcgaca atgtcacggt gctggactat gatgcccgag agcagcgagt ttactggtct 5340 

gatgtgcgga ctcaagccat caaaagggca tttatcaacg gcactggcgt ggagaccgtt 5400 

gtctctgcag acttgcccaa cgcccacggg ctggctgtgg actgggtctc ccgaaatctg 5460 

ttttggacaa gttacgacac caacaagaag cagattaacg tggcccggct ggacggctcc 5520 

ttcaagaatg cggtggtgca gggcctggag cagccccacg gcctggtcgt ccacccgctt 5580 

cgtggcaagc tctactggac tgatggggac aacatcagca tggccaacat ggatgggagc 5640 

aaccacactc tgctcttcag tggccagaag ggccctgtgg ggttggccat tgacttccct 5700 

gagagcaaac tctactggat cagctctggg aaccacacaa tcaaccgttg caatctggat 5760 

gggagcgagc tggaggtcat cgacaccatg cggagccagc tgggcaaggc cactgccctg 5820 

gccatcatgg gggacaagct gtggtgggca gatcaggtgt cagagaagat gggcacgtgc 5880 

aacaaagccg atggctctgg gtccgtggtg ctgcggaaca gtaccacgtt ggttatgcac 5940 

atgaaggtgt atgacgagag catccagcta gagcatgagg gcaccaaccc ctgcagtgtc 6000 

aacaacggag actgttccca gctctgcctg ccaacatcag agacgactcg ctcctgtatg 6060 

tgtacagccg gttacagcct ccggagcgga cagcaggcct gtgagggtgt gggctctttt 6120 

ctcctgtact ctgtacatga gggaattcgg gggattccac tagatcccaa tgacaagtcg 6180 

gatgccctgg tcccagtgtc cggaacttca ctggctgtcg gaatcgactt ccatgccgaa 6240 

aatgacacta tttattgggt ggatatgggc ctaagcacca tcagcagggc caagcgtgac 63 00 

cagacatggc gagaggatgt ggtgaccaac ggtattggcc gtgtggaggg catcgccgtg 6360 

gactggatcg caggcaacat atactggacg gaccagggct tcgatgtcat cgaggttgcc 6420 

cggctcaatg gctcttttcg ttatgtggtc atttcccagg gtctggacaa gcctcgggcc 6480 

atcactgtcc acccagagaa ggggtacttg ttctggaccg agtggggtca ttacccacgt 6540 

attgagcggt ctcgccttga tggcacagag agagtggtgt tggttaatgt cagcatcagc 6600 

tggcccaatg gcatctcagt agactatcag ggcggcaagc tctactggtg tgatgctcgg 6660 

atggacaaga tcgagcgcat cgacctggaa acgggcgaga accgggaggt ggtcctgtcc 6720 

agcaataaca tggatatgtt ctccgtgtcc gtgtttgagg acttcatcta ctggagtgac 6780 

agaactcacg ccaatggctc catcaagcgc ggctgcaaag acaatgctac agactccgtg 6840 

cctctgagga caggcattgg tgttcagctt aaagacatca aggtcttcaa cagggacagg 6900 

cagaagggta ccaatgtgtg cgcggtagcc aacggcgggt gccagcagct ctgcttgtat 6960 

cggggtggcg gacagcgagc ctgtgcctgt gcccacggga tgctggcaga agacggggcc 7020 

tcatgccgag agtacgctgg ctacctgctc tactcagagc ggaccatcct caagagcatc 7080 

cacctgtcgg atgagcgtaa cctcaacgca ccggtgcagc cctttgaaga ccccgagcac 714 0 

atgaaaaatg tcatcgccct ggcctttgac taccgagcag gcacctcccc ggggacccct 72 00 

aaccgcatct tcttcagtga catccacttt gggaacatcc agcagatcaa tgacgatggc 7260 

tcgggcagga ccaccatcgt ggaaaatgtg ggctctgtgg aaggectggc ctatcaccgt 7320 

ggctgggaca cactgtactg gacaagctac accacatcca ccatcacccg ccacaccgtg 7380 

gaccagactc gcccaggggc cttcgagagg gagacagtca tcaccatgtc cggagacgac 7440 

cacccgagag cctttgtgct ggatgagtgc cagaacctga tgttctggac caattggaac 7500 

gagctccatc caagcatcat gcgggcagcc ctatccggag ccaacgtcct gaccctcatt 7560 

gagaaggaca tccgcacgcc caatgggttg gccatcgacc accgggcgga gaagctgtac 7620 

ttctcggatg ccaccttgga caagatcgag cgctgcgagt acgacggctc ccaccgctat 7680 

gtgatcctaa agtcggagcc cgtccacccc tttgggttgg cggtgtacgg agagcacatt 7740 

ttctggactg actgggtgcg gcgggctgtg cagcgagcca acaagtatgt gggcagcgac 7800 

atgaagctgc ttcgggtgga cattccccag caacccatgg gcatcafecgc cgtggccaat 7860 

gacaccaaca gctgtgaact ctccccctgc cgtatcaaca atggaggctg ccaggatctg 7920 

tgtctgctca cccaccaagg ccacgtcaac tgttcctgtc gagggggccg gatcctccag 7980 

gaggacttca cctgccgggc tgtgaactcc tcttgtcggg cacaagatga gtttgagtgt 8040 

gccaatgggg aatgtatcag cttcagcctc acctgtgatg gcgtctccca ctgcaaggac 8100 

aagtccgatg agaagccctc ctactgcaac tcacgccgct gcaagaagac tttccgccag 8160 

tgtaacaatg gccgctgtgt atccaacatg ctgtggtgca atggggtgga ttactgtggg 8220 

gatggctctg atgagatacc ttgcaacaag actgcctgtg gtgtgggtga gttccgctgc 8280 

cgggatgggt cctgcatcgg gaactccagt cgctgcaacc agtttgtgga ttgtgaggat 8340 

gcctcggatg agatgaattg cagtgccaca gactgcagca gctatttccg cctgggcgtg 8400 

aaaggtgtcc tcttccagcc gtgcgagcgg acatccctgt gctacgcacc tagctgggtg 8460 

tgtgatggcg ceaacgactg tggagactac agcgatgaac gtgactgtcc aggtgtgaag 8520 
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cgccctaggt gcccgctcaa ttactttgcc tgccccagcg ggcgctgtat ccccatgagc 8580 

tggacgtgtg acaaggagga tgactgtgag aacggcgagg atgagaccca ctgcaacaag 8640 

ttctgctcag aggcacagtt cgagtgccag aaccaccggt gtatctccaa gcagtggctg 8700 

tgtgacggta gcgatgattg cggggatggc tccgatgagg cagctcactg tgaaggcaag 8760 

acatgtggcc cctcctcctt ctcctgtccc ggcacccacg tgtgtgtccc tgagcgctgg 8820 

ctctgtgatg gcgacaagga ctgtaccgat ggcgcggatg agagtgtcac tgctggctgc 8880 

ctgtacaaca gcacctgtga tgaccgtgag ttcatgtgcc agaaccgctt gtgtattccc 8940 

aagcatttcg tgtgcgacca tgaccgtgac tgtgctgatg gctctgatga atcccctgag 9000 

tgtgagtacc caacctgcgg gcccaatgaa ttccgctgtg ccaatgggcg ttgtctgagc 9060 

tcccgtcagt gggaatgtga tggggagaat gactgtcacg accacagcga tgaggctccc 9120 

aagaacccac actgcaccag cccagagcac aaatgcaatg cctcatcaca gttcctgtgc 9180 

agcagcgggc gctgcgtggc tgaggcgttg ctctgcaacg gccaggacga ctgtggggac 9240 

ggttcagacg aacgcgggtg ccatgtcaac gagtgtctca gccgcaagct cagtggctgc 9300 

agtcaggact gcgaggacct caagataggc tttaagtgcc gctgtcgccc gggcttccgg 9360 

ctaaaggacg atggcaggac ctgtgccgac ctggatgagt gcagcaccac cttcccctgc 9420 

agccagctct gcatcaacac ccacggaagt tacaagtgtc tgtgtgtgga gggctatgca 9480 

ccccgtggcg gtgaccccca cagctgcaaa gctgtgaccg atgaggagcc atttctcatc 9540 

tttgccaacc ggtactacct gcggaagctc aacctggacg gctccaacta cacactgctt 9600 

aagcagggcc tgaacaatgc ggtcgccttg gcatttgact accgagagca gatgatctac 9660 

tggacgggcg tgaccaccca gggcagcatg attcgcagga tgcacctcaa cggcagcaac 9720 

gtgcaggttc tgcaccggac gggccttagt aacccagatg ggctcgctgt ggactgggtg 9780 

ggtggcaacc tgtactggtg tgacaagggc agagatacca ttgaggtgtc caagcttaac 9840 

ggggcctatc ggacagtgct ggtcagctct ggcctccggg agcccagagc tctggtagtg 9900 

gatgtacaga atgggtacct gtactggaca gactggggtg accactcact gatcggccgg 9960 

attggcatgg atggatctgg ccgcagcatc atcgtggaca ctaagatcac atggcccaat 10020 

ggcctgaccg tggactacgt cacggaacgc atctactggg ctgacgcccg tgaggactac 10080 

atcgagttcg ccagcctgga tggctccaac cgtcacgttg tgctgagcca agacatccca 10140 

cacatctttg cgctgaccct atttgaagac tacgtctact ggacagactg ggaaacgaag 10200 

tccatcaacc gggcccacaa gaccacgggt gccaacaaaa cactcctcat cagcaccctg 10260 

caccggccca tggacttaca tgtattccac gccctgcgcc agccagatgt gcccaatcac 10320 

ccctgcaaag tcaacaatgg tggctgcagc aacctgtgcc tgctgtcccc tgggggtggt 10380 

cacaagtgcg cctgccccac caacttctat ctgggtggcg atggccgtac ctgtgtgtcc 10440 

aactgcacag caagccagtt tgtgtgcaaa aatgacaagt gcatcccctt ctggtggaag 10500 

tgtgacacgg aggacgactg tggggatcac tcagacgagc ctccagactg tcccgagttc 10560 

aagtgccgcc caggccagtt ccagtgctcc accggcatct gcaccaaccc tgccttcatc 10620 

tgtgatgggg acaatgactg ccaagacaat agtgacgagg ccaattgcga cattcacgtc 10680 

tgcttgccca gccaattcaa gtgcaccaac accaaccgct gcattcctgg catcttccgt 10740 

tgcaatgggc aggaeaactg cggggacggc gaggatgagc gggattgccc tgaggtgacc 10800 

tgcgccccca accagttcca gtgctccatc accaagcgct gcatccctcg cgtctgggtc 10860 

tgtgacaggg ataatcactg tgtggacggc agtgatgagc ctgccaactg tacccaaatg 10920 

acctgtggag tggatgagtt ccgctgcaag gattctggcc gctgcatccc cgcgcgctgg 10980 

aagtgtgacg gagaagatga ctgtggggat ggttcagatg agcccaagga agagtgtgat 11040 

gagcgcacct gtgagccata ccagttccgc tgcaaaaaca accgctgtgt cccaggccgt 11100 

tggcaatgtg actacgacaa cgactgcgga gataactcgg acgaggagag ctgcacacct 11160 

cggccctgct ctgagagtga gtttttctgt gccaatggcc gctgcatcgc tgggcgctgg 11220 

aagtgtgatg gggaccatga ctgtgccgac ggctcagacg agaaagactg caccccccgc 11280 

tgtgatatgg accagttcca gtgcaagagt ggccactgca tccccctgcg ctggccgtgt 11340 

gacgcggatg ctgactgtat ggacggcagt gacgaggaag cctgtggcac tggggtgagg 11400 

acctgcccat tggatgagtt tcaatgtaac aacaccttgt gcaagccgct ggcctggaag 11460 

tgtgatggag aggacgactg tggggacaac tcagatgaga accccgagga atgcgcccgg 11520 

ttcatctgcc ctcccaaccg gcctttccgc tgcaagaatg accgagtctg cctgtggatt 11580 

gggcgccagt gtgatggcgt ggacaactgt ggagatggga ctgacgagga ggactgtgag 11640 

ccccccacgg cccagaaccc ccactgcaaa gacaagaagg agttcctgtg ccgaaaccag 11700 

cgctgtctat catcctccct gcgctgtaac atgttcgatg actgcggcga tggctccgat 11760 

gaagaagatt gcagcatcga ccccaagctg accagctgtg ccaccaatgc cagcatgtgt 11820 

ggggacgaag ctcgttgtgt gcgcactgag aaagctgcct actgtgcctg ccgctcgggc 11880- 

ttccatactg tgccgggcca gcccggatgc caggacatca acgagtgcct gcgctttggt 11940 

acctgctctc agctctggaa caaacccaag ggaggccacc tctgcagctg tgcccgcaac 12000 
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ttcatgaaga cacacaacac ctgcaaagct gaaggctccg agtaccaggt gctatacatc 12060 

gcggatgaca acgagatccg cagcttgttc ccgggccacc cccactcagc ctacgagcag 12120 

acattccagg gcgatgagag tgtccgcata gatgccatgg atgtccatgt caaggccggc 12180 

cgtgtctact ggactaactg gcacacgggc acaatctcct acaggagcct gccccctgcc 12240 

gcccctccta ccacttccaa ccgccaccgg aggcagatcg accggggtgt cacccacctc 12300 

aatatttcag ggctgaagat gccgaggggt atcgctatcg actgggtggc cgggaatgtg 12360 

tactggaccg attccggccg agacgtgatt gaggtggcgc aaatgaaggg cgagaaccgc 12420 
aagacgctca tctcgggcat gattgatgag ccccatgcca tcgtggtgga ccctctgagg ■ 12480 

ggcaccatgt actggtcaga ctgggggaac caccccaaga ttgaaacagc agcgatggat 12540 

ggcacccttc gggagactct cgtgcaagac aacattcagt ggcctacagg gctggctgtg 12600 

gactatcaca atgaacggct ctactgggca gatgccaagc tttcggtcat cggcagcatc 12660 

cggctcaacg gcactgaccc cattgtggct gctgacagca aacgaggcct aagtcacccc 12720 

ttcagcatcg atgtgtttga agactacatc tacggagtca cttacatcaa taatcgtgtc 12780 

ttcaagatcc acaagtttgg acacagcccc ttgtacaacc taactggggg cctgagccat 12 840 

gcctctgatg tagtccttta ccatcaacac aagcagcctg aagtgaccaa cccctgtgac 12900 

cgcaagaaat gcgaatggct gtgtctgctg agccccagcg ggcctgtctg cacctgtcce 12960 

aatggaaaga ggctggataa tggcacctgt gtgcctgtgc cctctccaac accccctcca 13020 

gatgccccta ggcctggaac ctgcactctg cagtgcttca atggtggtag ttgtttcctc 13080 

aacgctcgga ggcagcccaa gtgccgttgc cagccecgtt acacaggcga taagtgtgag 13140 

ctggatcagt gctgggaata ctgtcacaac ggaggcacct gtgcggcttc cccatctggc 13200 

atgcccacgt gccgctgtcc cactggcttc acgggcccca aatgcaccgc acaggtgtgt 13260 

gcaggctact gctctaacaa cagcacctgc accgtcaacc agggcaacca gccccagtgc 13320 

cgatgtctac ctggcttcct gggcgaccgt tgccagtacc ggcagtgctc tggcttctgt 13380 

gagaactttg gcacctgtca gatggctgct gatggctccc gacaatgtcg ctgcaccgtc 13440 

tactttgagg gaccaaggtg tgaggtgaac aagtgtagtc gctgtctcca aggcgcctgt 13500 

gtggtcaata agcagaccgg agatgtcaca tgcaactgca ctgatggecg ggtagccccc 13560 

agttgtctca cctgcatcga tcactgtagc aatggtggct cctgcaccat gaacagcaag 13620 

atgatgcctg agtgccagtg cccgccccat atgacaggac cccggtgcca ggagcaggtt 13680 

gttagtcagc aacagcctgg gcatatggcc tccatcctga tccctctgct gctgcttctc 13740 

ctgctgcttc tggtggctgg cgtggtgttc tggtataagc ggcgagtccg aggggctaag 13800 

ggcttccagc accagcggat gaccaatggg gccatgaatg tggaaattgg aaaccctacc 13 860 

tacaagatgt atgaaggtgg agagcccgat gatgtcgggg gcctactgga tgctgatttt 13 920 

gcccttgacc ctgacaagcc taccaacttc accaacccag tgtatgccac gctctacatg 13980 

gggggccacg gcagccgcca ttccctggcc agcacggacg agaagcgaga actgctgggc 14040 

cggggacctg aagacgagat aggagatccc ttggcatagg gccctgcccc gacggatgtc 14100 

cccagaaagc cccctgccac atgagtcttt caatgaaccc cctccccagc cggcccttct 14160 

ccggccctgc cgggtgtaca aatgtaaaaa tgaaggaatt actttttata tgtgagcgag 14220 

caagcgagca agcacagtat tatctctttg catttccttc ctgcctgctc ctcagtatcc 14280 

cccccatgct gccttgaggg ggcggggagg gctttgtggc tcaaaggtat gaaggagtcc 14340 

acatgttccc taccgagcat acccctggaa gcctggcggc acggcctccc caccacgcct 14400 

gtgcaagaca ctcaacgggg ctccgtgtcc cagctttcct ttccttggct ctctggggtt 14460 

agttcagggg aggtggagtc ctctgctgac ectgtctgga agatttggct ctagctgagg 14520 

aaggagtctt ttagttgagg gaagtcaccc caaaccccag ctcccacttt caggggcacc 14580 

tctcagatgg ccatgctcag tatcccttcc agacaggccc tcccctctct agcgccccct 14640 

ctgtggctcc tagggctgaa cacattcttt ggtaactgtc ccccaagcct cccatccccc 14700 

tgagggccag gaagagtcgg ggcacaccaa ggaagggcaa gcgggcagcc ccattttggg 14760 

gacgtgaacg ttttaataat ttttgctgaa ttcctttaca actaaataac acagatattg 14820 

ttataaataa aattgtaaaa aaaaaaaaa 14849 
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Leu 


Thr 


Gly 

■ 2 


Asn Phe Tyr 
295 


Phe 


He 


Phe 


Val 


Cys 


Asn 


Arg Asn Gly 


Asp 


305 










310 




Leu 


Glu 


Leu 


Tyr 


Asn 
325 


Pro Lys Gly 


He 


Lys 


Val 


Phe 


Phe 
340 


Thr 


Asp Tyr Gly 


Gin 
345 


Asp 


Met 


Asp 
355 


Gly 


Gin 


Asn Arg Thr 
360 


Lys 


Phe 


Pro 
370 


His 


Gly 


lie 


Thr Leu Asp 
375 


Leu 


Ala 


Asp 


Ala 


Tyr 


Leu 


Asp Tyr lie 


Glu 


385 










390 




Glv 


Arq 


Gin 


Thr 


He 
405 


He Gin Gly 


He 


Leu 


Thr 


Val 


Phe 
420 


Glu 


Asn Tyr Leu 


Tyr 
425 


Asn 


Thr 


Gin 
435 


Gin 


Lys 


Thr Ser Val 
440 


He 


Thr 


Glu 
450 


Tyr 


Gin 


Val 


Val Thr Arg 
455 


Val 
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15 




Lys Thr 


Cys 


Ser 


Pro 


Lys 


Gin 








30 






lie Ser 


Lys Gly 


Trp 

- XT 


Arq 


Cys 






45 








Ser Asp 


Glu Ala 


Pro 


Glu 


lie 




60 










Pro Pro 


Asn 


Glu 


His 


Ser 


Cys 


75 










80 


Ser Arg 


Leu Cys 


Asn 


Gly 


lie 


90 








95 




Gly Ala 


His 


Cys 


Arq 


Glu 


Leu 








110 






Gin His 


His 


Cys 


Val 


Pro 


Thr 






125 








Ser Ser 


Phe 


Gin 


Leu 


Glu 


Ala 




140 










Glu Cys 


Ser Val 


Tyr 


Gly 


Thr 


155 










160 


Gly Ser 


Phe 


Thr 


Cys 

2 


Gly 


Cys 


170 








175 




Asn Arg 


Ser Cys 


Lys 


Ala 


LVS 








190 






Leu Leu 


He 


Ala 


Asn 


Ser 


Gin 






205 








Ala Gin 


Val 


Ser 


Thr 


He 


Thr 




220 










Met Asp 


Phe 


Ser 


Tyr 


Ala 


Asn 


235 










240 


Asp Ser 


Ala 


Ala 


Gin 


Thr 


Gin 


250 








255 




Lys Gly 


Phe 


Val 


Asp 


Glu 


His 








270 






His Val 


Glu 


Gin 


Met 


Ala 


lie 






285 








Val Asp 


Asp 


He 


Asp 


ASD 


Arq 




300 










Thr Cys 


Val 


Thr 


Leu 


Leu 


Asp 


315 










320 


Ala Leu 


Asp 


Pro 


Ala 


Met 


Gly 


330 








335 




lie Pro 


Lys Val 


Glu 


Arq 


Cys 

2 








350 






Leu Val 


Asp 


Ser 


Lys 


He 


Val 






365 








Val Ser 


Arg Leu 


Val 


Tyr 


Trp 




380 










Val Val 


Asp 


Tyr 


Glu 


Glv 


Lys 


395 










400 


Leu He 


Glu 


His 


Leu 


Tyr 


Gly 


410 








415 




Ala Thr 


Asn 


Ser 


Asp 


Asn 


Ala 








430 






Arg Val 


Asn Arg 


Phe 


Asn 


Ser 






445 








Asp Lys 


Gly Gly 


Ala 


Leu 


His 



460 
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lie Tyr 


His 


Gin 


465 






Asn Asp 


Gin 


Tyr 


Ala Asn 


Ser 


His 






500 


Leu Gly 


Ser 


Asp 




515 




Leu Val 


Tyr 


Gly 


530 






Gly Ala 


Lys 


Val 


545 






Asn Pro 


Arg 


Ala 


Ala Asp 


Thr 


Thr 






580 


Glu Arg 


Glu 


Thr 




595 




Ala Val 


Asp 


Trp 


610 






liVS Lvs 


Thr 


lie 


625 






Lys Thr 


Leu 


lie 


Asp Pro 


Leu 


Asn 






660 


Lys Asp 


Ser 


Arg 




675 




His Arg 


ASP 


lie 


690 






Leu Ser 


Leu 


Asp 


705 






Tyr Asd 


Arg 


He 


Val Tyr 


Glu 


Gly 






740 


Gly Asn 


Tyr 


Leu 




755 




Leu Glu 


Arg 


Gly 


770 






Ser Glu 


Arq 


Pro 


785 






Gin Gin 


Val 


Gly 


Ser Leu 


CVS 


Leu 






820 


Asp Gin 


Val 


Leu 




835 




Tyr Val 


Pro 


Pro 


850 






Asn Arg 


Cvs 


He 


865 






Leu Asp 


Asn 


Ser 


Pro Ser 


Asp 


Arg 






900 


Trp Leu 


Cys 


Asp 




915 





Arq 


Arq 


Gin 


Pro 




470 






Gly 

-* 


Lys 


Pro 


Gly 


485 








Lys 


Ala 


Arg 


Thr 


Gly 


Lys 


Ser 


Cys 








520 


Lys 


Gly 


Arg 


Pro 






535 




Pro 


Asd 


Glu 


His 




550 






Leu 


Asp 


Phe 


His 


565 








Ser 


Tvr 


Leu 


He 


He 


Leu 


Lvs 


Asd 








600 


Met 


Gly 


Asp 


Asn 






615 




Ser 


Val 


Ala 


Ara 




630 






Glu 


Glv 


Lys 


Met 


645 








GlV 


Tro 


Met 


Ay a. 


Arq 


Glv 


Arq 


Leu 








680 


Phe 


Val 


Thr 


Ser 






695 




He 


Pro 


Ala 


Glv 




710 






Glu 


Thr 


He 


Leu 


725 








Pro 


Glu 


Leu 


Asn 


Phe 


Trp 


Thr 


Glu 








760 


Val 


Ala 


Gly 


Ala 






775 




Pro 


He 


Phe 


Glu 




790 






Thr 


Asn 


Lys 


Cvs 


805 








Ala 


Thr 


Pro 


Glv 


Asp 


Thr 


Asn 


Glv 








840 


Pro 


Gin 


Cys 


Gin 






855 




Gin 


Glu 


Arg 


Trp 




870 






Asp 


Glu 


Ala 


Pro 


885 








Phe 


Lys 


Cys 


Glu 


Gly 


Asp 


Asn 


Asp 



920 



Arg Val Arg Ser 
475 

Gly Cys Ser Asp 
490 

Cys Arg Cys Arg 
505 

Lys Lys Pro Glu 

Gly He He Arg 
540 

Met He Pro lie 
555 

Ala Glu Thr Gly 
570 

Gly Arg Gin Lys 
585 

Gly He His Asn 

Leu Tyr Trp Thr 
620 

Leu Glu Lys Ala 
635 

Thr His Pro Arg 
650 

Trp Thr Asp Trp 
665 

Glu Arg Ala Trp 

Lys Thr Val Leu 
700 

Arg Leu Tyr Trp 
715 

Leu Asn Gly Thr 
730 

His Ala Phe Gly 
745 

Tyr Arg Ser Gly 

Pro Pro Thr Val 
780 

He Arg Met Tyr 
795 

Arg Val Asn Asn 
810 

Ser Arg Gin Cys 
825 

Val Thr Cys *Leu 

Pro Gly Gin Phe 
860 

Lys Cys Asp Gly 
875 

Ala Leu Cys His 
890 

Asn Asn Arg Cys 
905 

Cys Gly Asn Ser 



His 


Ala 


Cys 


Glu 








480 


He 


Cys 


Leu 


Leu 






495 




Ser 


Gly 


Phe 


Ser 




510 






His 


Glu 


Leu 


Phe 


525 








Gly 


Met 


Asp 


Met 


Glu 


Asn 


Leu 


Met 








560 


Phe 


lie 


Tyr 


Phe 






575 




He 


Asp 


Gly 


Thr 




590 






Val 


Glu 


Gly 


Val 


605 








Asp 


Asp 


Gly 


Pro 


Ala 


Gin 


Thr 


Arq 








640 


Ala 


He 


Val 


Val 






655 




Glu 


Glu 


Asp 


Pro 




670 






Met 


Asp 


Gly 


Ser 


685 








Trp 


Pro 


Asn 


Gly 


Val 


Asp 


Ala 


Phe 








720 


Asp 


Arg 


Lys 


lie 






735 




Leu 


Cys 


His 


His 




750 






Ser 


Val 


Tvr 


Arq 


765 








Thr 


Leu 


Leu 


Arq 


Asp 


Ala 


His 


Glu 








800 


Gly 


Gly 


Cys 


Ser 






815 




Ala 


CVS 


Ala 


Glu 




830 






Ala 


Asn 


Pro 


Ser 


845 








Ala 


Cvs 


Ala 


Asn 


Asp 


Asn 


Asp 


Cys 








880 


Gin 


His 


Thr 


Cys 






895 




He 


Pro 


Asn 


Arg 




910 






Glu 


Asp 


Glu 


Ser 



925 
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Abii Ala Thr Cys Ser Ala Arg Thr Cys Pro Pro Asn Gin Phe Ser Cys 

930 935 940 

Ala Ser Gly Arg Cys lie Pro lie Ser Trp Thr Cys Asp Leu Asp Asp 
945 950 955 960 

Asp Cys Gly Asp Arg Ser Asp Glu Ser Ala Ser Cys Ala Tyr Pro Thr 

965 970 975 

Cys Phe Pro Leu Thr Gin Phe Thr Cys Asn Asn Gly Arg Cys lie Asn 

980 985 990 

He Asn Trp Arg Cys Asp Asn Asp Asn Asp Cys Gly Asp Asn Ser Asp 

995 1000 .1005 

Glu Ala Gly Cys Ser His Ser Cys Ser Ser Thr Gin Phe Lys Cys Asn 

1010 1015 . 1020 

Ser Gly Arg Cys He Pro Glu His Trp Thr Cys Asp Gly Asp Asn Asp 
1025 1030 1035 1040 

Cys Gly Asp Tyr Ser Asp Glu Thr His Ala Asn Cys Thr Asn Gin Ala 

1045 1050 1055 

Thr Arg Pro Pro Gly Gly Cys His Ser Asp Glu Phe Gin Cys Pro Leu 

1060 1065 1070 

Asp Gly Leu Cys He Pro Leu Arg Trp Arg Cys Asp Gly Asp Thr Asp 

1075 1080 1085 

Cys Met Asp Ser Ser Asp Glu Lys Ser Cys Glu Gly Val Thr His Val 

1090 1095 1100 

Cys Asp Pro Asn Val .Lys Phe Gly Cys Lys Asp Ser Ala Arg Cys He 
1105 1110 1115 1120 

Ser Lys Ala Trp Val Cys Asp Gly Asp Ser Asp Cys Glu Asp Asn Ser 

1125 1130 1135 

Asp Glu Glu Asn Cys Glu Ala Leu Ala Cys Arg Pro Pro Ser His Pro 

1140 1145 1150 

Cys Ala Asn Asn Thr Ser Val Cys Leu Pro Pro Asp Lys Leu Cys Asp 

1155 1160 1165 

Gly Lys Asp Asp Cys Gly Asp Gly Ser Asp Glu Gly Glu Leu Cys Asp 

1170 1175 1180 

Gin Cys Ser Leu Asn Asn Gly Gly Cys Ser His Asn Cys Ser Val Ala 
1185 1190 1195 1200 

Pro Gly Glu Gly He Val Cys Ser Cys Pro Leu Gly Met Glu Leu Gly 

1205 1210 1215 

Ser Asp Asn His Thr Cys Gin He Gin Ser Tyr Cys Ala Lys His Leu 

1220 1225 1230 

Lys Cys Ser Gin Lys Cys Asp Gin Asn Lys Phe Ser Val Lys Cys Ser 

1235 1240 1245 

Cys Tyr Glu Gly Trp Val Leu Glu Pro Asp Gly Glu Thr Cys Arg Ser 

1250 1255 1260 

Leu Asp Pro Phe Lys Leu Phe He lie Phe Ser Asn Arg His Glu He 
1265 1270 1275 ~ 1280 

Arg Arg He Asp Leu His Lys Gly Asp Tyr Ser Val Leu Val Pro Gly 

1285 1290 1295 

Leu Arg Asn Thr He Ala Leu Asp Phe His Leu Ser Gin Ser Ala Leu 

1300 1305 1310 

Tyr Trp Thr Asp Ala Val Glu Asp Lys lie Tyr Arg Gly Lys Leu Leu 

1315 1320 1325 

Asp Asn Gly Ala Leu Thr Ser Phe Glu Val Val lie Gin Tyr Gly Leu 

1330 1335 1340 

Ala Thr Pro Glu Gly Leu Ala Val Asp Trp He Ala Gly Asn He Tyr 
1345 1350 * 1355 1360 

Trp Val Glu Ser Asn Leu Asp Gin He Glu Val Ala Lys Leu Asp Gly 

1365 1370 1375 

Thr Leu Arg Thr Thr Leu Leu Ala Gly Asp He Glu His Pro Arg Ala 
1380 1385 1390 
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lie Ala lieu Asp Pro Arg Asp Gly lie Leu Phe Trp Thr Asp Trp Asp 

1395 1400 1405 

Ala Ser Leu Pro Arg lie Glu Ala Ala Ser Met Ser Gly Ala Gly Arg 

1410 1415 1420 

Arg Thr lie His Arg Glu Thr Gly Ser Gly Gly Cys Ala Asn Gly Leu 
1425 1430 1435 1440 

Thr Val Asp Tyr Leu Glu Lys Arg lie Leu Trp lie Asp Ala Arg Ser 

1445 1450 1455 

Asp Ala lie Tyr Ser Ala Arg Tyr Asp Gly Ser Gly His Met Glu Val 

1460 1465 1470 

Leu Arg Gly His Glu Phe Leu Ser His Pro Phe Ala Val Thr Leu Tyr 

1475 1480 1485 

Gly Gly Glu Val Tyr Trp Thr Asp Trp Arg Thr Asn Thr Leu Ala Lys 

1490 1495 1500 

Ala Asn Lys Trp Thr Gly His Asn Val Thr Val Val Gin Arg Thr Asn 
1505 1510 1515 1520 

Thr Gin Pro Phe Asp Leu Gin Val Tyr His Pro Ser Arg Gin Pro Met 

1525 1530 1535 

Ala Pro Asn Pro Cys Glu Ala Asn Gly Gly Arg Gly Pro Cys Ser His 

1540 1545 1550 

Leu Cys Leu He Asn Tyr Asn Arg Thr Val Ser Trp Ala Cys Pro His 

1555 1560 1565 

Leu Met Lys Leu His Lys Asp Asn Thr Thr Cys Tyr Glu Phe Lys Lys 

1570 1575 1580 

Phe Leu Leu Tyr Ala Arg Gin Met Glu He Arg Gly Val Asp Leu Asp 
1585 1590 1595 1600 

Ala Pro Tyr Tyr Asn Tyr He He Ser Phe Thr Val Pro Asp He Asp 

1605 1610 1615 

Asn Val Thr Val Leu Asp Tyr Asp Ala Arg Glu Gin Arg Val Tyr Trp 

1620 1625 1630 

Ser Asp Val Arg Thr Gin Ala He Lys Arg Ala Phe He Asn Gly Thr 

1635 1640 1645 

Gly Val Glu Thr Val Val Ser Ala Asp Leu Pro Asn Ala His Gly Leu 

1650 1655 1660 

Ala Val Asp Trp Val Ser Arg Asn Leu Phe Trp Thr Ser Tyr Asp Thr 
1665 1670 1675 ~ 1680 

Asn Lys Lys Glii He Asn Val Ala Arg Leu Asp Gly Ser Phe Lys Asn 

1685 1690 1695 

Ala Val Val Gin Gly Leu Glu Gin Pro His Gly Leu Val Val His Pro 

1700 1705 1710 

Leu Arg Gly Lys Leu Tyr Trp Thr Asp Gly Asp Asn He Ser Met Ala 

1715 1720 1725 

Asn Met Asp Gly Ser Asn His Thr Leu Leu Phe Ser Gly Gin Lys Gly 

1730 1735 1740 

Pro Val Gly Leu Ala lie Asp Phe Pro Glu Ser Lys Leu Tyr Trp He 
1745 1750 1755 1760 

Ser Ser Gly Asn His Thr He Asn Arg Cys Asn Leu Asp Gly Ser Glu 

1765 1770 1775 

Leu Glu Val lie Asp Thr Met Arg Ser Gin Leu Gly Lys Ala Thr Ala 

1780 1785 1790 

Leu Ala He Met Gly Asp Lys Leu Trp Trp Ala Asp Gin Val Ser Glu 

1795 1800 1805 

Lys Met Gly Thr Cys Asn Lys Ala Asp Gly Ser Gly Ser Val Val Leu 

1810 1815 1820 

Arg Asn Ser Thr Thr Leu Val Met His Met Lys Val Tyr Asp Glu Ser 
1825 1830 1835 1840 

He Gin Leu Glu His Glu Gly Thr Asn Pro Cys Ser Val Asn Asn Gly 
1845 1850 1855 



9 



WO 01/92474 PCT/US01/18041 



Asp Cys Ser Gin Leu Cys Leu Pro Thr Ser Glu Thr Thr Arg Ser Cys 

1860 1865 1870 

Met Cys Thr Ala Gly Tyr Ser Leu Arg Ser Gly Gin Gin Ala Cys Glu 

1875 1880 1885 

Gly Val Gly Ser Phe Leu Leu Tyr Ser Val His Glu Gly lie Arg Gly 

1890 1895 1900 

lie Pro Leu Asp Pro Asn Asp Lys Ser Asp Ala Leu Val Pro Val Ser 
1905 1910 1915 1920 

Gly Thr Ser Leu Ala Val Gly lie Asp Phe His Ala Glu Asn Asp Thr 

1925 1930 1935 

lie Tyr Trp Val Asp Met Gly Leu Ser Thr lie Ser Arg Ala Lys Arg 

1940 1945 1950 

Asp Gin Thr Trp Arg Glu Asp Val Val Thr Asn Gly lie Gly Arg Val 

1955 1960 1965 

Glu Gly lie Ala Val Asp Trp lie Ala Gly Asn lie Tyr Trp Thr Asp 

1970 1975 1980 

Gin Gly Phe Asp Val lie Glu Val Ala Arg Leu Asn Gly Ser Phe Arg 
1985 1990 1995 2000 

Tyr Val Val lie Ser Gin Gly Leu Asp Lys Pro Arg Ala He Thr Val 

2005 2010 2015 

His Pro Glu Lys Gly Tyr Leu Phe Trp Thr Glu Trp Gly His Tyr Pro 

2020 2025 2030 

Arg He Glu Arg Ser Arg Leu Asp Gly Thr Glu Arg Val Val Leu Val 

2035 2040 2045 

Asn Val Ser lie Ser Trp Pro Asn Gly He Ser Val Asp Tyr Gin Gly 

2050 2055 2060 

Gly Lys Leu Tyr Trp Cys Asp Ala Arg Met Asp Lys He Glu Arg He 
2065 2070 2075 2080 

Asp Leu Glu Thr Gly Glu Asn Arg Glu Val Val Leu Ser Ser Asn Asn 

2085 2090 2095 

Met Asp Met Phe Ser Val Ser Val Phe Glu Asp Phe He Tyr Trp Ser 

2100 2105 2110 

Asp Arg Thr His Ala Asn Gly Ser He Lys Arg Gly Cys Lys Asp Asn 

2115 2120 2125 

Ala Thr Asp Ser Val Pro Leu Arg Thr Gly He Gly Val Gin Leu Lys 

2130 2135 2140 

Asp He Lys Val Phe Asn Arg Asp Arg Gin Lys Gly Thr Asn Val Cys 
2145 2150 2155 2160 

Ala Val Ala Asn Gly Gly Cys Gin Gin Leu Cys Leu Tyr Arg Gly Gly 

2165 2170 * 2175 

Gly Gin Arg Ala Cys Ala Cys Ala His Gly Met Leu Ala Glu Asp Gly 

2180 2185 2190 

Ala Ser Cys Arg Glu Tyr Ala Gly Tyr Leu Leu Tyr Ser Glu Arg Thr 

2195 2200 2205 

He Leu Lys Ser He His Leu Ser Asp Glu Arg Asn Leu Asn Ala Pro 

2210 2215 2220 

Val Gin Pro Phe Glu Asp Pro Glu His Met Lys Asn Val He Ala Leu 
2225 2230 2235 2240 

Ala Phe Asp Tyr Arg Ala Gly Thr Ser Pro Gly Thr Pro Asn Arg He 

2245 2250 2255 

Phe Phe Ser Asp He His Phe Gly Asn He Glri Gin He Asn Asp Asp 

2260 2265 2270 

Gly Ser Gly Arg Thr Thr He Val Glu Asn Val Gly Ser Val Glu Gly 

2275 2280 2285 

Leu Ala Tyr His Arg Gly Trp Asp Thr Leu Tyr Trp Thr Ser Tyr Thr 

2290 2295 2300 

Thr Ser Thr He Thr Arg His Thr Val Asp Gin Thr Arg Pro Gly Ala 
2305 2310 2315 ~ 2320 
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Phe Glu Arg Glu Thr Val lie Thr Met Ser Gly Asp Asp His Pro Arg 

2325 2330 ~ 2335 

Ala Phe Val Leu Asp Glu Cys Gin Asn Leu Met Phe Trp Thr Asn Trp 

2340 2345 2350 

Asn Glu Leu His Pro Ser lie Met Arg Ala Ala Leu Ser Gly Ala Asn 

2355 2360 2365 

Val Leu Thr Leu lie Glu Lys Asp lie Arg Thr Pro Asn Gly Leu Ala 

2370 2375 2380 

lie Asp His Arg Ala Glu Lys Leu Tyr Phe Ser Asp Ala Thr Leu Asp 
2385 2390 2395 2400 

Lys lie Glu Arg Cys Glu Tyr Asp Gly Ser His Arg Tyr Val He Leu 

2405 2410 2415 

Lys Ser Glu Pro Val His Pro Phe Gly Leu Ala Val Tyr Gly Glu His 

2420 2425 2430 

He Phe Trp Thr Asp Trp Val Arg Arg Ala Val Gin Arg Ala Asn Lys 

2435 2440 2445 

Tyr Val Gly Ser Asp Met Lys Leu Leu Arg Val Asp lie Pro Gin Gin 

2450 2455 2460 

Pro Met Gly lie lie Ala Val Ala Asn Asp Thr Asn Ser Cys Glu Leu 
2465 2470 ~ 2475 2480 

Ser Pro Cys Arg He Asn Asn Gly Gly Cys Gin Asp Leu Cys Leu Leu 

2485 2490 2495 

Thr His Gin Gly His Val Asn Cys Ser Cys Arg Gly Gly Arg He Leu 

2500 2505 2510 

Gin Glu Asp Phe Thr Cys Arg Ala Val Asn Ser Ser Cys Arg Ala Gin 

2515 2520 2525 

Asp Glu Phe Glu Cys Ala Asn Gly Glu Cys He Ser Phe Ser Leu Thr 

2530 2535 2540 

Cys Asp Gly Val Ser His Cys Lys Asp Lys Ser Asp Glu Lys Pro Ser 
2545 2550 2555 2560 

Tyr Cys Asn Ser Arg Arg Cys Lys Lys Thr Phe Arg Gin Cys Asn Asn 

2565 2570 2575 

Gly Arg Cys Val Ser Asn Met Leu Trp Cys Asn Gly Val Asp Tyr Cys 

2580 2585 2590 

Gly Asp Gly Ser Asp Glu He Pro Cys Asn Lys Thr Ala Cys Gly Val 

2595 2600 2605 

Gly Glu Phe Arg Cys Arg Asp Gly Ser Cys lie Gly Asn Ser Ser Arg 

2610 2615 2620 

Cys Asn Gin Phe Val Asp Cys Glu Asp Ala Ser Asp Glu Met Asn Cys 
2625 2630 2635 2640 

Ser Ala Thr Asp Cys Ser Ser Tyr Phe Arg Leu Gly Val Lys Gly Val 

2645 2650 2655 

Leu Phe Gin Pro Cys Glu Arg Thr Ser Leu Cys Tyr Ala Pro Ser Trp 

2660 2665 2670 

Val Cys Asp Gly Ala Asn Asp Cys Gly Asp Tyr Ser Asp Glu Arg Asp 

2675 2680 2685 

Cys Pro Gly Val Lys Arg Pro Arg Cys Pro Leu Asn Tyr Phe Ala Cys 

2690 2695 2700 

Pro Ser Gly Arg Cys He Pro Met Ser Trp Thr Cys Asp Lys Glu Asp 
2705 2710 2715 2720 

Asp Cys Glu Asn Gly Glu Asp Glu Thr His Cys Asn Lys Phe Cys Ser 

2725 2730 2735 

Glu Ala Gin Phe Glu Cys Gin Asn His Arg Cys He Ser Lys Gin Trp 

2740 2745 2750 

Leu Cys Asp Gly Ser Asp Asp Cys Gly Asp Gly Ser Asp Glu Ala Ala 

2755 2760 2765 

His Cys Glu Gly Lys Thr Cys Gly Pro Ser Ser Phe Ser Cys Pro Gly 
2770 2775 2780 
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Thr His Val Cys Val Pro Glu Arg Trp Leu Cys Asp Gly Asp Lys Asp 
2785 2790 2795 2800 

Gys Thr Asp Gly Ala Asp Glu Ser Val Thr Ala Gly Cys Leu Tyr Asn 

2805 2810 2815 

Ser Thr Cys Asp Asp Arg Glu Phe Met Cys Gin Asn Arg Leu Cys He 

2820 2825 2830 

Pro Lys His Phe Val Cys Asp His Asp Arg Asp Cys Ala Asp Gly Ser 

2835 2840 ~ 2845 

Asp Glu Ser Pro Glu Cys Glu Tyr Pro Thr Cys Gly Pro Asn Glu Phe 

2850 2855 2860 

Arg Cys Ala Asn Gly Arg Cys Leu Ser Ser Arg Gin Trp Glu Cys Asp 
2865 2870 2875 2880 

Gly Glu Asn Asp Cys His Asp His Ser Asp Glu Ala Pro Lys Asn Pro 

2885 2890 2895 

His Cys Thr Ser Pro Glu His Lys Cys Asn Ala Ser Ser Gin Phe Leu 

2900 2905 2910 

Cys Ser Ser Gly Arg Cys Val Ala Glu Ala Leu Leu Cys Ash Gly Gin 

2915 2920 2925 

Asp Asp Cys Gly Asp Gly Ser Asp Glu Arg Gly Cys His Val Asn Glu 

2930 2935 2940 

Cys Leu Ser Arg Lys Leu Ser Gly Cys Ser Gin Asp Cys Glu Asp Leu 
2945 2950 2955 2960 

Lys He Gly Phe Lys Cys Arg Cys Arg Pro Gly Phe Arg Leu Lys Asp 

2965 2970 2975 

Asp Gly Arg Thr Cys Ala Asp Leu Asp Glu Cys Ser Thr Thr Phe Pro 

2980 2985 2990 

Cys Ser Gin Leu Cys He Asn Thr His Gly Ser Tyr Lys Cys Leu Cys 

2995 3000 3005 

Val Glu Gly Tyr Ala Pro Arg Gly Gly Asp Pro His Ser Cys Lys Ala 

3010 3015 3020 

Val Thr Asp Glu Glu Pro Phe Leu He Phe Ala Asn Arg Tyr Tyr Leu 
3025 3030 3035 3040 

Arg Lys Leu Asn Leu Asp Gly Ser Asn Tyr Thr Leu Leu Lys Gin Gly 

3045 3050 3055 

Leu Asn Asn Ala Val Ala Leu Ala Phe Asp Tyr Arg Glu Gin Met lie 

3060 3065 3070 

Tyr Trp Thr Gly Val Thr Thr Gin Gly Ser Met He Arg Arg Met His 

3075 3080 3085 

Leu Asn Gly Ser Asn Val Gin Val Leu His Arg Thr Gly Leu Ser Asn 

3090 3095 3100 

Pro Asp Gly Leu Ala Val Asp Trp Val Gly Gly Asn Leu Tyr Trp Cys 
3105 3110 3115 3120 

Asp Lys Gly Arg Asp Thr He Glu Val Ser Lys Leu Asn Gly Ala Tyr 

3125 3130 3135 

Arg Thr Val Leu Val Ser Ser Gly Leu Arg Glu Pro Arg Ala Leu Val 

3140 3145 3150 

Val Asp Val Gin Asn Gly Tyr Leu Tyr Trp Thr Asp Trp Gly Asp His 

3155 3160 3165 

Ser Leu He Gly Arg He Gly Met Asp Gly Ser Gly Arg Ser lie He 

3170 3175 3180 

Val Asp Thr Lys He Thr Trp Pro Asn Gly Leu Thr Val Asp Tyr Val 
3185 3190 3195 3200 

Thr Glu Arg He Tyr Trp Ala Asp Ala Arg Glu Asp Tyr He Glu Phe 

3205 3210 3215 

Ala Ser Leu Asp Gly Ser Asn Arg His Val Val Leu Ser Gin Asp He 

3220 3225 3230 

Pro His He Phe Ala Leu Thr Leu Phe Glu Asp Tyr Val Tyr Trp Thr 
3235 3240 3245 
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Asp Trp Glu Thr Lys Ser lie Asn Arg Ala His Lys Thr Thr Gly Ala 

3250 3255 3260 

Asn Lys Thr Leu Leu He Ser Thr Leu His Arg Pro Met Asp Leu His 
3265 3270 3275 3280 

Val Phe His Ala Leu Arg Gin Pro Asp Val Pro Asn His Pro Cys Lys 

3285 3290 3295 

Val Asn Asn Gly Gly Cys Ser Asn Leu Cys Leu Leu Ser Pro Gly Gly 

3300 3305 3310 

Gly His Lys Cys Ala Cys Pro Thr Asn Phe Tyr Leu Gly Gly Asp Gly 

3315 3320 3325 

Arg Thr Cys Val Ser Asn Cys Thr Ala Ser Gin Phe Val Cys Lys Asn 

3330 3335 3340 

Asp Lys Cys He Pro Phe Trp Trp Lys Cys Asp Thr Glu Asp Asp Cys 
3345 3350 3355 3360 

Gly Asp His Ser Asp Glu Pro Pro Asp Cys Pro Glu Phe Lys Cys Arg 

3365 3370 3375 

Pro Gly Gin Phe Gin Cys Ser Thr Gly He Cys Thr Asn Pro Ala Phe 

3380 3385 3390 

He Cys Asp Gly Asp Asn Asp Cys Gin Asp Asn Ser Asp Glu Ala Asn 

3395 3400 3405 

Cys Asp He His Val Cys Leu Pro Ser Gin Phe Lys Cys Thr Asn Thr 

3410 3415 3420 

Asn Arg Cys He Pro Gly He Phe Arg Cys Asn Gly Gin Asp Asn Cys 
3425 3430 3435 3440 

Gly Asp Gly Glu Asp Glu Arg Asp Cys Pro Glu Val Thr Cys Ala Pro 

3445 3450 3455 

Asn Gin Phe Gin Cys Ser lie Thr Lys Arg Cys He Pro Arg Val Trp 

3460 3465 3470 

Val Cys Asp Arg Asp Asn His Cys Val Asp Gly Ser Asp Glu Pro Ala 

3475 3480 3485 

Asn Cys Thr Gin Met Thr Cys Gly Val Asp Glu Phe Arg Cys Lys Asp 

3490 3495 3500 

Ser Gly Arg Cys He Pro Ala Arg Trp Lys Cys Asp Gly Glu Asp Asp 
3505 3510 3515 3520 

Cys Gly Asp Gly Ser Asp Glu Pro Lys Glu Glu Cys Asp Glu Arg Thr 

3525 3530 3535 

Cys Glu Pro Tyr Gin Phe Arg Cys Lys Asn Asn Arg Cys Val Pro Gly 

3540 3545 3550 

Arg Trp Gin Cys Asp Tyr Asp Asn Asp Cys Gly Asp Asn Ser Asp Glu 

3555 3560 3565 

Glu Ser Cys Thr Pro Arg Pro Cys Ser Glu Ser Glu Phe Phe Cys Ala 

3570 3575 3580 

Asn Gly Arg Cys lie Ala Gly Arg Trp Lys Cys Asp Gly Asp His Asp 
3585 3590 3595 3600 

Cys Ala Asp Gly Ser Asp Glu Lys Asp Cys Thr Pro Arg Cys Asp Met 

3605 3610 3615 

Asp Gin Phe Gin Cys Lys Ser Gly His Cys He Pro Leu Arg Trp Pro 

3620 3625 3630 

Cys Asp Ala Asp Ala Asp Cys Met Asp Gly Ser Asp Glu Glu Ala Cys 

3635 3640 3645 

Gly Thr Gly Val Arg Thr Cys Pro Leu Asp Glu Phe Gin Cys Asn Asn 

3650 3655 3660 

Thr Leu Cys Lys Pro Leu Ala Trp Lys Cys Asp Gly Glu Asp Asp Cys 
3665 3670 3675 *" 3680 

Gly Asp Asn Ser Asp Glu Asn Pro Glu Glu Cys Ala Arg Phe He Cys 

3685 3690 3695 

Pro Pro Asn Arg Pro Phe Arg Cys Lys Asn Asp Arg Val Cys Leu Trp 
3700 3705 3710 
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lie Gly Arg Gin Cys Asp Gly Val Asp Asrt Cys Gly Asp Gly Thr Asp 

3715 3720 3725 

Glu Glu Asp Cys Glu Pro Pro Thr Ala Gin Asn Pro His Cys Lys Asp 

3730 3735 3740 

Lys Lys Glu Phe Leu Cys Arg Asn Gin Arg Cys Leu Ser Ser Ser Leu 
3745 3750 3755 3760 

Arg Cys Asn Met Phe Asp Asp Cys Gly Asp Gly Ser Asp Glu Glu Asp 

3765 3770 3775 

Cys Ser lie Asp Pro Lys Leu Thr Ser Cys Ala Thr Asn Ala Ser Met 

3780 3785 3790 

Cys Gly Asp Glu Ala Arg Cys Val Arg Thr Glu Lys Ala Ala Tyr Cys 

3795 3800 3805 

Ala Cys Arg Ser Gly Phe His Thr Val Pro Gly Gin Pro Gly Cys Gin. 

3810 3815 * 3820 

Asp He Asn Glu Cys Leu Arg Phe Gly Thr Cys Ser Gin Leu Trp Asn 
3825 3830 3835 ' 3840 

Lys Pro Lys Gly Gly His Leu Cys Ser Cys Ala Arg Asn Phe Met Lys 

3845 3850 3855 

Thr His Asn Thr Cys Lys Ala Glu Gly Ser Glu Tyr Gin Val Leu Tyr 

3860 3865 3870 

He Ala Asp Asp Asn Glu He Arg Ser Leu Phe Pro Gly His Pro His 

3875 3880 3885 

Ser Ala Tyr Glu Gin Thr Phe Gin Gly Asp Glu Ser Val Arg He Asp 

3890 3895 3900 

Ala Met Asp Val His Val Lys Ala Gly Arg Val Tyr Trp Thr Asn Trp 
3905 3910 3915 3920 

His Thr Gly Thr He Ser Tyr Arg Ser Leu Pro Pro Ala Ala Pro Pro 

3925 3930 3935 

Thr thr Ser Asn Arg His Arg Arg Gin He Asp Arg Gly Val Thr His 

3940 3945 3950 

Leu Asn He Ser Gly Leu Lys Met Pro Arg Gly lie Ala He Asp Trp 

3955 3960 3965 

Val Ala Gly Asn Val Tyr Trp Thr Asp Ser Gly Arg Asp Val He Glu 

3970 3975 3980 

Val Ala Gin Met Lys Gly Glu Asn Arg Lys Thr Leu He Ser Gly Met 
3985 3990 3995 4000 

He Asp Glu Pro His Ala He Val Val Asp Pro Leu Arg Gly Thr Met 

4005 4010 4015 

Tyr Trp Ser Asp Trp Gly Asn His Pro Lys lie Glu Thr Ala Ala Met 

4020 4025 4030 

Asp Gly Thr Leu Arg Glu Thr Leu Val Gin Asp Asn He Gin Trp Pro 

4035 4040 4045 

Thr Gly Leu Ala Val Asp Tyr His Asn Glu Arg Leu Tyr Trp Ala Asp 

4050 4055 4060 

Ala Lys Leu Ser Val He Gly Ser He Arg Leu Asn Gly Thr Asp Pro 
4065 4070 4075 4080 

He Val Ala Ala Asp Ser Lys Arg Gly Leu Ser His Pro Phe Ser He 

4085 4090 4095 

Asp Val Phe Glu Asp Tyr He Tyr Gly Val Thr Tyr lie Asn Asn. Arg 

4100 4105 4110 

Val Phe Lys lie His Lys Phe Gly His Ser Pro Leu Tyr Asn Leu Thr 

4115 4120 4125 

Gly Gly LeU Ser His Ala Ser Asp Val Val Leu Tyr His Gin His Lys 

4130 4135 4140 

Gin Pro Glu Val Thr Asn Pro Cys Asp Arg Lys Lys Cys Glu Trp Leu 
4145 4150 4155 4160 

Cys Leu Leu Ser Pro Ser Gly Pro Val Cys Thr Cys Pro Asn Gly Lys 



4165 



4170 



4175 
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Arg Leu Asp Asn Gly Thr Cys Val Pro Val Pro Ser Pro Thr Pro Pro 

4180 4185 4190 

Pro Asp Ala Pro Arg Pro Gly Thr Cys Thr Leu Gin Cys Phe Asn Gly 

4195 4200 4205 

Gly Ser Cys Phe Leu Asn Ala Arg Arg Gin Pro Lys Cys Arg Cys Gin 

4210 4215 4220 

Pro Arg Tyr Thr Gly Asp Lys Cys Glu Leu Asp Gin Cys Trp Glu Tyr 
4225 4230 4235 4240 

Cys His Asn Gly Gly Thr Cys Ala Ala Ser Pro Ser Gly Met Pro Thr 

4245 4250 4255 

Cys Arg Cys Pro Thr Gly Phe Thr Gly Pro Lys Cys Thr Ala Gin Val 

4260 4265 4270 

Cys Ala Gly Tyr Cys Ser Asn Asn Ser Thr Cys Thr Val Asn Gin Gly 

4275 4280 4285 • 

Asn Gin Pro Gin Cys Arg Cys Leu Pro Gly Phe Leu Gly Asp Arg Cys 

4290 4295 4300 

Gin Tyr Arg Gin Cys Ser Gly Phe Cys Glu Asn Phe Gly Thr Cys Gin 
4305 4310 4315 * 4320 " 

Met Ala Ala Asp Gly Ser Arg Gin Cys Arg Cys Thr Val Tyr Phe Glu 

4325 4330 4335 

Gly Pro Arg Cys Glu Val Asn Lys Cys Ser Arg Cys Leu Gin Gly Ala 

4340 4345 4350 

Cys Val Val Asn Lys Gin Thr Gly Asp Val Thr Cys Asn Cys Thr Asp 

4355 4360 4365 

Gly Arg Val Ala Pro Ser Cys Leu Thr Cys He Asp His Cys Ser Asn 

4370 4375 4380 

Gly Gly Ser Cys Thr Met Asn Ser Lys Met Met Pro Glu Cys Gin Cys 
4385 4390 4395 4400 

Pro Pro His Met Thr Gly Pro Arg Cys Gin Glu Gin Val Val Ser Gin 

4405 4410 4415 

Gin Gin Pro Gly His Met Ala Ser He Leu He Pro Leu Leu Leu Leu 

4420 4425 4430 

Leu Leu Leu Leu Leu Val Ala Gly Val Val Phe Trp Tyr Lys Arg Arg 

4435 4440 4445 

Val Arg Gly Ala Lys Gly Phe Gin His Gin Arg Met Thr Asn Gly Ala 

4450 4455 4460 

Met Asn Val Glu He Gly Asn Pro Thr Tyr Lys Met Tyr Glu Gly Gly 
4465 4470 4475 4480 

Glu Pro Asp Asp Val Gly Gly Leu Leu Asp Ala Asp Phe Ala Leu Asp 

4485 4490 4495 

Pro Asp Lys Pro Thr Asn Phe Thr Asn Pro Val Tyr Ala Thr Leu Tyr 

4500 4505 4510 

Met Gly Gly His Gly Ser Arg His Ser Leu Ala Ser Thr Asp Glu Lys 

4515 4520 4525 

Arg Glu Leu Leu Gly Arg Gly Pro Glu Asp Glu He Gly Asp Pro Leu 
4530 4535 4540 

Ala 
4545 

<210> 3 

<211> 4577 

<212> DNA 

<213> Homo sapiens 

<400> 3 

gctacaatcc atctggtctc ctccagctcc ttctttctgc aacatgggga agaacaaact 60 

ccttcatcca agtctggttc ttctcctctt ggtcctcctg cccacagacg cctcagtctc 120 

tggaaaaccg cagtatatgg ttctggtccc ctccctgctc cacactgaga ccactgagaa 180 



15 



WO 01/92474 



PCT/US01/18041 



gggctgtgtc cttctgagct acctgaatga gacagtgact gtaagtgctt ccttggagtc 240 

tgtcagggga aacaggagcc tcttcactga cctggaggcg gagaatgacg tactccactg 300 

tgtcgccttc gctgtcccaa agtcttcatc caatgaggag gtaatgttcc tcactgtcca 360 

agtgaaagga ccaacccaag aatttaagaa gcggaccaca gtgatggtta agaacgagga 420 

cagtctggtc tttgtccaga cagacaaatc aatctacaaa ccagggcaga cagtgaaatt 480 

tcgtgttgtc tccatggatg aaaactttca ccccctgaat gagttgattc cactagtata 540 

cattcaggat cccaaaggaa atcgcatcgc acaatggcag agtttccagt tagagggtgg 600 

cctcaagcaa ttttcttttc ccctctcatc agagcccttc cagggctcct acaaggtggt 660 

ggtacagaag aaatcaggtg gaaggacaga gcaccctttc accgtggagg aatttgttct 720 

tcceaagttt gaagtacaag taacagtgcc aaagataatc accatcttgg aagaagagat 780 

gaatgtatca gtgtgtggcc tatacacata tgggaagcct gtccctggac atgtgactgt 840 

gagcatttgc agaaagtata gtgacgcttc cgactgccac ggtgaagatt cacaggcttt 900 

ctgtgagaaa ttcagtggac agctaaacag ccatggctgc ttctatcagc aagtaaaaac 960 

caaggtcttc cagctgaaga ggaaggagta tgaaatgaaa cttcacactg aggcccagat 1020 

ccaagaagaa ggaacagtgg tggaattgac tggaaggcag tccagtgaaa tcacaagaac 1080 

cataaccaaa ctctcatttg tgaaagtgga ctcacacttt cgacagggaa ttcccttctt 1140 

tgggcaggtg cgcctagtag atgggaaagg cgtccctata ccaaataaag teat at teat 1200 

cagaggaaat gaagcaaact attactccaa tgctaccacg gatgagcatg gecttgtaca 1260 

gttctctatc aacaccacca acgttatggg tacctctctt actgttaggg tcaattacaa 1320 

ggatcgtagt ccctgttacg gctaccagtg ggtgtcagaa gaacacgaag aggcacatca 1380 

cactgettat cttgtgttct ccccaagcaa gagctttgtc caccttgagc ccatgtctca 1440 

tgaactaccc tgtggccata ctcagacagt ccaggcacat tatattctga atggaggcac 1500 

cctgctgggg ctgaagaagc tctcctttta ttatctgata atggcaaagg gaggcattgt 1560 

ccgaactggg actcatggac tgcttgtgaa gcaggaagac atgaagggee atttttccat 1620 

ctcaatccct gtgaagtcag acattgctcc tgtegctegg ttgetcatet atgctgtttt 1680 

acctaccggg gacgtgattg gggattctgc aaaatatgat gttgaaaatt gtctggccaa 1740 

caaggtggat ttgagcttca gcccatcaca aagtctccca gcctcacacg cccacctgcg 1800 

agtcacagcg gctcctcagt ccgtctgcgc cctccgtgct gtggaccaaa gcgtgctgct 1860 

catgaagect gatgetgage tctcggcgtc cteggtttae aacctgctac cagaaaagga 1920 

cctcactggc ttccctgggc ctttgaatga ccaggacgat gaagactgea teaategtea 1980 

taatgtctat attaatggaa tcacatatac tccagtatca agtacaaatg aaaaggatat 2040 

gtacagcttc ctagaggaca tgggcttaaa ggcattcacc aactcaaaga ttegtaaace 2100 

caaaatgtgt ccacagcttc aacagtatga aatgcatgga cctgaaggtc tacgtgtagg 2160 

tttttatgag tcagatgtaa tgggaagagg ccatgcacgc ctggtgcatg ttgaagagee 2220 

teacaeggag acegtacgaa agtacttccc tgagacatgg atctgggatt tggtggtggt 2280 

aaactcagca ggggtggctg aggtaggagt aacagtccct gacaccatca ccgagtggaa 2340 

ggcaggggee ttctgcctgt ctgaagatgc tggacttggt atctcttcca ctgcctctct 2400 

ccgagccttc cagcccttct ttgtggagct tacaatgect tactctgtga ttcgtggaga 2460 

ggccttcaca ctcaaggcca eggtcctaaa ctaccttccc aaatgeatec gggtcagtgt 2520 

gcagctggaa gcctctcccg ccttccttgc tgtcccagtg gagaaggaac aagcgcctca 2580 

ctgcatctgt geaaaeggge ggcaaactgt gtcctgggca gtaaccccaa agtcattagg 2640 

aaatgtgaat ttcactgtga gegcagagge actagagtct caagagctgt gtgggactga 2700 

ggtgccttca gttcctgaac aeggaaggaa agacacagtc atcaagcctc tgttggttga 2760 

acctgaagga ctagagaagg aaacaacatt caactcccta ctttgtccat caggtggtga 2820 

ggtttctgaa gaattatccc tgaaactgee accaaatgtg gtagaagaat ctgcccgagc 2880 

ttctgtctca gttttgggag acatattagg ctctgccatg caaaacacac aaaatcttct 2940 

ccagatgccc tatggctgtg gagagcagaa tatggtcctc tttgetccta acatctatgt 3 000 

actggattat ctaaatgaaa cacagcagct tactccagag gtcaagtcca aggecattgg 3060 

ctatctcaac actggttacc agagacagtt gaactacaaa cactatgatg gctcctacag 3120 

cacctttggg gagegatatg gcaggaacca gggcaacacc tggctcacag cctttgttct 3180 

gaagactttt gcccaagctc gagectacat cttcatcgat gaagcacaca ttacccaagc 3240 

cctcatatgg ctctcccaga ggcagaagga caatggctgt ttcaggagct ctgggtcact 3300 

gctcaacaat gecataaagg gaggagtaga agatgaagtg accctctccg cctatatcac 3360 

catcgccctt ctggagattc ctctcacagt cactcaccct gttgtccgca atgccctgtt 3420 

ttgcctggag tcagcctgga agacagcaca agaaggggac catggcagcc atgtatatac 3480 

caaagcactg ctggcctatg cttttgccct ggcaggtaac caggacaaga ggaaggaagt 3540 

actcaagtca cttaatgagg aagctgtgaa gaaagacaac tctgtccatt gggagcgccc 3600 

tcagaaaccc aaggcaccag tggggcattt ttacgaaccc caggctccct ctgctgaggt 3660 
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ggagatgaca tcctatgtgc tcctcgctta tctcacggcc cagccagccc caacctcgga 3720 

ggacctgacc tctgcaacca acatcgtgaa gtggatcacg aagcagcaga atgcccaggg 3780 

cggtttctcc tccacccagg acacagtggt ggctctccat gctctgtcca aatatggagc 384 0 

cgccacattt accaggactg ggaaggctgc acaggtgact atccagtctt cagggacatt 3 900 

ttccagcaaa ttccaagtgg acaacaacaa tcgcctgtta ctgcagcagg tctcattgcc 3960 

agagctgcct ggggaataca gcatgaaagt gacaggagaa ggatgtgtct acctccagac 4020 

ctccttgaaa tacaatattc tcccagaaaa ggaagagttc ccctttgctt taggagtgca 4080 

gactctgcct caaacttgtg atgaacccaa agcccacacc agcttccaaa tctccctaag 4140 

tgtcagttac acagggagcc gctctgcctc caacatggcg atcgttgatg tgaagatggt 4200 

ctctggcttc attcccctga agccaacagt gaaaatgctt gaaagatcta accatgtgag 4260 

ccggacagaa gtcagcagca accatgtctt gatttacctt gataaggtgt caaatcagac 4320 

actgagcttg ttcttcacgg ttctgcaaga tgtcccagta agagatctca aaccagccat 4380 

agtgaaagtc tatgattact acgagacgga tgagtttgca atcgctgagt acaatgctcc 4440 

ttgcagcaaa gatcttggaa atgcttgaag accacaaggc tgaaaagtgc tttgctggag 4500 

tcctgttctc tgagctccac agaagacacg tgtttttgta tctttaaaga cttgatgaat 4560 

aaacactttt tctggtc 4577 

<210> 4 

<211> 4422 

<212> DNA 

<213> Homo sapiens 



<400> 4 

atggggaaga acaaactcct tcatccaagt ctggttcttc tcctcttggt cctcctgccc 60 

acagacgcct cagtctctgg aaaaccgcag tatatggttc tggtcccctc cctgctccac 120 

actgagacca ctgagaaggg ctgtgtcctt ctgagctacc tgaatgagac agtgactgta 180 

agtgcttcct tggagtctgt caggggaaac aggagcctct tcactgacct ggaggcggag 240 

aatgacgtac tccactgtgt cgccttcgct gtcccaaagt cttcatccaa tgaggaggta 3 00 

atgttcctca ctgtccaagt gaaaggaeca acccaagaat ttaagaagcg gaccacagtg 360 

atggttaaga acgaggacag tctggtcttt gtccagacag acaaatcaat ctacaaacca 420 

gggcagacag tgaaatttcg tgttgtctcc atggatgaaa actttcaccc cctgaatgag 4 80 

ttgattccac tagtatacat tcaggatccc aaaggaaatc gcatcgcaca atggcagagt 54 0 

ttccagttag agggtggcct caagcaattt tcttttcccc tctcatcaga gcccttccag 600 

ggctcctaca aggtggtggt acagaagaaa tcaggtggaa ggacagagca ccctttcacc 660 

gtggaggaat ttgttcttcc caagtttgaa gtacaagtaa cagtgccaaa gataatcacc 720 

atcttggaag aagagatgaa tgtatcagtg tgtggcctat acacatatgg gaagcctgtc 780 

cctggacatg tgactgtgag catttgcaga aagtatagtg acgcttccga ctgccacggt 840 

gaagattcac aggctttctg tgagaaattc agtggacagc taaacagcca tggctgcttc 900 

tatcagcaag taaaaaccaa ggtcttccag ctgaagagga aggagtatga aatgaaactt 960 

cacactgagg cccagatcca agaagaagga acagtggtgg aattgactgg aaggcagtcc 1020 

agtgaaatca caagaaccat aaccaaactc tcatttgtga aagtggactc acactttcga 1080 

cagggaattc ccttctttgg gcaggtgcgc ctagtagatg ggaaaggcgt ccctatacca 1140 

aataaagtca tattcatcag aggaaatgaa gcaaactatt actccaatgc taccacggat 1200 

gagcatggcc ttgtacagtt ctctatcaac accaccaacg ttatgggtac ctctcttact 1260 

gttagggtca attacaagga tcgtagtccc tgttacggct accagtgggt gtcagaagaa 1320 

cacgaagagg cacatcacac tgcttatctt gtgttctccc caagcaagag ctttgtccac 1380 

cttgagccca tgtctcatga actaccctgt ggccatactc agacagtcca ggcacattat 1440 

attctgaatg gaggcaccct gctggggctg aagaagctct ccttttatta tctgataatg 1500 

gcaaagggag gcattgtccg aactgggact catggactgc ttgtgaagca ggaagacatg 1560 

aagggccatt tttccatctc aatccctgtg aagtcagaca ttgctcctgt cgctcggttg 1620 

ctcatctatg ctgttttacc taccggggac gtgattgggg attctgcaaa atatgatgtt 1680 

gaaaattgtc tggccaacaa ggtggatttg agcttcagcc catcacaaag tctcccagcc 1740 

tcacacgccc acctgcgagt cacagcggct cctcagtccg tctgcgccct ccgtgctgtg 1800 

gaccaaagcg tgctgctcat gaagcctgat gctgagctct cggcgtcctc ggtttacaac 1860 

ctgctaccag aaaaggacct cactggcttc cctgggcctt tgaatgacca ggacgatgaa 1920 

gactgcatca atcgtcataa tgtctatatt aatggaatca catatactcc agtatcaagt 1980 

acaaatgaaa aggatatgta cagcttccta gaggacatgg gcttaaaggc attcaccaac 2040 

tcaaagattc gtaaacccaa aatgtgtcca cagcttcaac agtatgaaat gcatggacct 2100 
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gaaggtctac gtgtaggttt ttatgagtca gatgtaatgg gaagaggcca tgcacgcctg 2160 

gtgcatgttg aagagcctca cacggagacc gtacgaaagt acttccctga gacatggatc 2220 

tgggatttgg tggtggtaaa ctcagcaggg gtggctgagg taggagtaac agtccctgac 2280 

accatcaccg agtggaaggc aggggccttc tgcctgtctg aagatgctgg acttggtatc 2340 

tcttccactg cctctctccg agccttccag cccttctttg tggagcttac aatgccttac 2400 

tctgtgattc gtggagaggc cttcacactc aaggccacgg tcctaaacta ccttcccaaa 2460 

tgcatccggg tcagtgtgca gctggaagcc tctcccgcct tccttgctgt cccagtggag 2520 

aaggaacaag cgcctcactg catctgtgca aacgggcggc aaactgtgtc ctgggcagta 2580 

accccaaagt cattaggaaa tgtgaatttc actgtgagcg cagaggcact agagtctcaa 2640 

gagctgtgtg ggactgaggt gccttcagtt cctgaacacg gaaggaaaga cacagtcatc 2700 

aagcctctgt tggttgaacc tgaaggacta gagaaggaaa caacattcaa ctccctactt 2760 

tgtccatcag gtggtgaggt ttctgaagaa ttatccctga aactgccacc aaatgtggta 2820 

gaagaatctg cccgagcttc tgtctcagtt ttgggagaca tattaggctc tgccatgcaa 2880 

aacacacaaa atcttctcca gatgccctat ggctgtggag agcagaatat ggtcctcttt 2940 

gctcctaaca tctatgtact ggattatcta aatgaaacac agcagcttac tccagaggtc 3000 

aagtccaagg ccattggeta tctcaacact ggttaccaga gacagttgaa ctacaaacac 3060 

tatgatggct cctacagcac ctttggggag cgatatggca ggaaccaggg caacacctgg 3120 

ctcacagcct ttgttctgaa gacttttgcc caagctcgag cctacatctt catcgatgaa 3180 

gcacacatta cccaagccct catatggctc tcccagaggc agaaggacaa tggctgtttc 3240 

aggagctctg ggtcactgct caacaatgcc ataaagggag gagtagaaga tgaagtgacc 3300 

ctctcegcct atatcaccat cgcccttctg gagattcctc tcacagtcac tcaccctgtt 3360 

gtccgeaatg ccctgttttg cctggagtca gcctggaaga cagcacaaga aggggaccat 3420 

ggcagccatg tatataccaa agcactgctg gcctatgctt ttgcQCtggc aggtaaccag 3480 

gacaagagga aggaagtact caagtcactt aatgaggaag ctgtgaagaa agacaactct 3540 

gtccattggg agcgccctca gaaacccaag gcaccagtgg ggcattttta cgaaccccag 3600 

gctccctctg ctgaggtgga gatgacatcc tatgtgctcc tcgcttatct cacggcccag 3660 

ccagccccaa cctcggagga cctgacctct gcaaccaaca tcgtgaagtg gatcacgaag 3720 

cagcagaatg cccagggcgg tttctcctcc acccaggaca • cagtggtggc tctccatgct 3780 

ctgtccaaat atggagccgc cacatttacc aggactggga aggctgcaca ggtgactatc 3840 

cagtcttcag ggacattttc cagcaaattc caagtggaca acaacaatcg cctgttactg 3900 

cagcaggtct cattgccaga gctgcctggg gaatacagca tgaaagtgac aggagaagga 3960 

tgtgtctacc tccagacctc cttgaaatac aatattctcc cagaaaagga agagttcccc 4020 

tttgctttag gagtgcagac tctgcctcaa acttgtgatg aacccaaagc ccacaccagc 4080 

ttccaaatct ccctaagtgt cagttacaca gggagccgct ctgcctccaa catggcgatc 4140 

gttgatgtga agatggtctc tggcttcatt cccctgaagc caacagtgaa aatgcttgaa 4200 

agatctaacc atgtgagccg gacagaagtc agcageaacc atgtcttgat ttaccttgat 4260 

aaggtgtcaa atcagacact gagcttgttc ttcacggttc tgcaagatgt cccagtaaga 4320 

gatctcaaac cagccatagt gaaagtctat gattactacg agacggatga gtttgcaatc 4380 

gctgagtaca atgctccttg cagcaaagat cttggaaatg ct 4422 
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Asp Glu 


Asn 


Phe His Pro Leu 


Asn Glu 


145 










150 








155 


160 


Leu 


He 


Pro 


Leu 


Val 


Tyr 


He 


Gin Asp 


Pro 


Lys Gly Asn Arg 


He Ala 










165 








170 




175 


Gin 


Trp 


Gin 


Ser 


Phe 


Gin 


Leu 


Glu Gly 


Gly 


Leu Lys Gin Phe 


Ser Phe 








180 








185 




190 




Pro 


Leu 


Ser 


Ser 


Glu 


Pro 


Phe 


Gin Gly 


Ser 


Tyr Lys Val Val 


Val Gin 






195 










200 




205 




Lys 


Lys 


Ser 


Gly 


Gly 


Arg 


Thr 


Glu His 


Pro 


Phe Thr Val Glu 


Glu Phe 




210 










215 






220 




Val 


Leu 


Pro 


Lys 


Phe 


Glu 


Val 


Gin Val 


Thr 


Val Pro Lys He 


He Thr 


225 










230 








235 


240 


lie 


Leu 


Glu 


Glu 


Glu 


Met 


Asn 


Val Ser 


Val 


Cys Gly Leu Tyr 


Thr Tyr 










245 








250 




255 


Gly 


Lys 


Pro 


Val 


Pro 


Gly 


His 


Val Thr 


Val 


Ser lie Cys Arg 


Lys Tyr 

J 2 








260 








265 




270 




Ser 


Asp 


Ala 


Ser 


Asp 


Cys 


His 


Gly Glu 


Asp 


Ser Gin Ala Phe 


Cys Glu 






275 










280 




285 




Lys 


Phe 


Ser 


Gly 


Gin 


Leu 


Asn 


Ser His 


Gly 


Cys Phe Tyr Gin 


Gin Val 




290 










295 






300 




Lye 


Thr 


Lys 


Val 


Phe 


Gin 


Leu 


Lys Arg 


Lys 


Glu Tyr Glu Met 


Lys Leu 


305 










310 








315 


320 


His 


Thr 


Glu 


Ala 


Gin 


lie 


Gin 


Glu Glu 


Gly 


Thr Val Val Glu 


Leu Thr 










325 








330 




335 


Gly 


Arg 


Gin 


Ser 


Ser 


Glu 


He 


Thr Arg 


Thr 


lie Thr Lys Leu 


Ser Phe 








340 








345 




350 




Val 


Lys 


Val 


Asp 


Ser 


His 


Phe 


Arg Gin 


Gly 


He Pro Phe Phe 


Gly Gin 






355 










360 




365 




Val 


Arg 


Leu 


Val 


Asp 


Gly 


Lys 


Gly Val 


Pro 


He Pro Ash Lys 


Val He 




370 










375 






380 




Phe 


He 


Arg 


Gly 


Asn 


Glu 


Ala 


Asn Tyr 


Tyr 


Ser Asn Ala Thr 


Thr Asp 


385 










390 








395 


400 


Glu 


His 


Gly 


Leu 


Val 


Gin 


Phe 


Ser He 


Asn 


Thr Thr Asn Val 


Met Gly 










405 








410 




415 


Thr 


Ser 


Leu 


Thr 


Val 


Arg 


Val 


Asn Tyr 


Lys 


Asp Arg Ser Pro 


Cys Tyr 








420 








425 




430 




Gly 


Tyr 


Gin 


Trp 


Val 


Ser 


Glu 


Glu His 


Glu 


Glu Ala His His 


Thr Ala 






435 










440 




445 




Tyr 


Leu 


Val 


Phe 


Ser 


Pro 


Ser 


Lys Ser 


Phe 


Val His Leu Glu 


Pro Met 




450 










455 






460 




Ser 


His 


Glu 


Leu 


Pro 


Cys 


Gly 


His Thr 


Gin 


Thr Val Gin Ala 


His Tyr 


465 










470 








475 


480 


He 


Leu 


Asn 


Gly 


Gly 


Thr 


Leu 


Leu Gly 


Leu 


Lys Lys Leu Ser 


Phe Tyr 










485 








490 




495 


Tvr 


Leu 


He 


Met 


Ala 


Lys 


Glv 


Gly He 


Val 


Arg Thr Gly Thr 


His Gly 








500 








505 




510 




Leu 


Leu 


Val 


Lys 


Gin 


Glu 


Asp 


Met Lys 


Gly 


His Phe Ser He 


Ser He 






515 










520 




525 




Pro 


Val 


Lys 


Ser 


Asp 


He 


Ala 


Pro Val 


Ala 


Arg Leu Leu He 


Tyr Ala 




53 0 










535 






540 




Val 


Leu 


Pro 


Thr 


Gly 


Asp 


Val 


He Gly 


Asp 


Ser Ala Lys Tyr 


Asp Val 


545 










550 








555 


560 
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Glu 


Asn 


Cys 
* 


Leu Ala Asn Lys 


Val 


Asp 








565 






Ser 


Leu 


Pro 


Ala Ser His Ala 


His 


Leu 








580 




585 


Ser 


Val 


Cys 


Ala Leu Arg Ala 


Val 


Asp 






595 




600 




Pro 


Asp 


Ala 


Glu Leu Ser Ala 


Ser 


Ser 




610 




615 






IiVS 


Asp 


Leu 


Thr Gly Phe Pro 


Gly 


Pro 


625 






630 






Asp 


Cvs 


lie 


Asn Arg His Asn 


Val 


Tyr 

2 








645 






Pro 


Val 


Ser 


Ser Thr Asn Glu 


Lys 

2 


Asp 








660 




665 


Met 


Gly 


Leu 


Lys Ala Phe Thr 


Asn 


Ser 






675 




680 




Cvs 


Pro 


Gin 


Leu Gin Gin Tyr 


Glu 


Met 




690 




695 






Val 


Glv 


Phe 


Tyr Glu Ser Asp 


Val 


Met 


705 






710 






Val 


His 


Val 


Glu Glu Pro His 


Thr 


Glu 








725 






Glu 


Thr 


Tro 


lie Trp Asp Leu 


Val 


Val 








740 




745 


Glu 


Val 


Gly 


Val Thr Val Pro 


Asp 


Thr 






755 




760 




Ala 


Phe 


Cvs 


Leu Ser Glu Asp 


Ala 


Glv 




770 




775 






Ser 


Leu 


Arg 


Ala Phe Gin Pro 


Phe 


Phe 


785 






790 






Ser 


Val 


lie 


Arg Gly Glu Ala 


Phe 


Thr 








805 






Tvr 


Leu 


Pro 


Lys Cys lie Arg 


Val 


Ser 








820 




825 


Ala 


Phe 


Leu 


Ala Val Pro Val 


Glu 


Lys 






835 




840 




Cvs 


Ala 


Asn 


Gly Arg Gin Thr 


Val 


Ser 




850 




855 






Leu 


Gly 


Asn 


Val Asn Phe Thr 


Val 


Ser 


865 






870 






Glu 


Leu 


Cvs 


Gly Thr Glu Val 


Pro 


Ser 








885 






Asp 


Thr 


Val 


lie Lys Pro Leu 


Leu 


Val 








900 




905 


Glu 


Thr 


Thr 


Phe Asn Ser Leu 


Leu 


Cvs 
* 






915 




920 




Glu 


Glu 


Leu 


Ser Leu Lys Leu 


Pro 


Pro 




930 




935 






Arg 


Ala 


Ser 


Val Ser Val Leu 


Gly 


Asp 


945 






950 






Asn 


Thr 


Gin 


Asn Leu Leu Gin 


Met 


Pro 








965 






Met 


Val 


Leu 


Phe Ala Pro Asn 


lie 


Tyr 








980 




985 


Thr 


Gin 


Gin 


Leu Thr Pro Glu 


Val 


Lys 






995 




1000 


Asn 


Thr 


Gly 


Tyr Gin Arg Gin 


Leu 


Asn 




1010 


1015 
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Leu Ser 


Phe 


Ser 


Pro 


Ser 


Gin 


570 








575 




Arg Val 


Thr 


Ala 


Ala 


Pro 


Gin 








590 






Gin Ser 


Val 


Leu 


Leu 


Met 


Lys 






605 








Val Tyr 


Asn 


Leu 


Leu 


Pro 


Glu 




620 










Leu Asn 


Asp 


Gin 


Asp 


Asp Glu 


635 










640 


lie Asn 


Gly 


He 


Thr 


Tyr Thr 


650 








655 




Met Tyr 


Ser 


Phe 


Leu 


Glu Asp 








670 






Lys lie 


Arg 


Lys 

2 


Pro 


Lys 


Met 






685 








His Gly 


Pro 


Glu 


Gly 


Leu Arg 




700 










Glv Arg 


Glv 


His 


Ala 


Arg Leu 


715 










720 


Thr Val 


Arg 


Lvs 

-2 a 


Tvr 


Phe 


Pro 


730 








735 




Val Asn 


Ser 


Ala 


Glv 


Val 


Ala 








750 






He Thr 


Glu 


Trp 


LVS 


Ala Gly 






765 








Leu Gly 


He 


Ser 


Ser 


Thr 


Ala 




780 










Val Glu 


Leu 


Thr 


Met 


Pro 


Tyr 


795 










800 


Leu Lys 


Ala 


Thr 


Val 


Leu 


Asn 


810 








815 




Val Gin 


Leu 


Glu 


Ala 


Ser 


Pro 








830 






Glu Gin 


Ala 


Pro 


His 


Cys 


He 






845 








Trp Ala 


Val 


Thr 


Pro 


Lys 


Ser 




860 










Ala Glu 


Ala 


Leu 


Glu 


Ser 


Gin 


8-75 










880 


Val Pro 


Glu 


His 


Glv 


Arg 


Lys 


890 








895 




Glu Pro 


Glu 


Glv 


Leu 


Glu 


Lys 








910 






Pro Ser 


Glv 


Glv 


Glu 


Val 


Ser 






925 








Asn Val 


Val 


Glu 


Glu 


Ser 


Ala 




940 










He Leu 


Gly 

2 


Ser 


Ala 


Met 


Gin 


955 










960 


Tyr Gly 


Cys 


Gly 


Glu 


Gin 


Asn 


970 








975 




Val Leu 


Asp 


Tyr 


Leu 


Asn 


Glu 








990 






Ser Lys 


Ala 


He 


Gly 


Tyr 


Leu 






1005 






Tyr Lys 


His 


Tyr Asp 


Gly Ser 



1020 
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Tyr Ser Thr Phe Gly Glu Arg Tyr Gly Arg Asn Gin Gly Asn Thr Trp 
1025 1030 1035 1040 

Leu Thr Ala Phe Val Leu Lys Thr Phe Ala Gin Ala Arg Ala Tyr lie 

1045 1050 1055 

Phe He Asp Glu Ala His He Thr Gin Ala Leu He Trp Leu Ser Gin 

1060 1065 1070 

Arg Gin Lys Asp Asn Gly Cys Phe Arg Ser Ser Gly Ser Leu Leu Asn 

1075 1080 1085 

Asn Ala He Lys Gly Gly Val Glu Asp Glu Val Thr Leu Ser Ala Tyr 

1090 1095 1100 

He Thr He Ala Leu Leu Glu lie Pro Leu Thr Val Thr His Pro Val 
1105 1110 1115 1120 

Val Arg Asn Ala Leu Phe Cys Leu Glu Ser Ala Trp Lys Thr Ala Gin 

1125 1130 1135 

Glu Gly Asp His Gly Ser His Val Tyr Thr Lys Ala Leu Leu Ala Tyr 

1140 1145 1150 

Ala Phe Ala Leu Ala Gly Asn Gin Asp Lys Arg Lys Glu Val Leu Lys 

1155 ' 1160 1165 

Ser Leu Asn Glu Glu Ala Val Lys Lys Asp Asn Ser Val His Trp Glu 

1170 1175 1180 

Arg Pro Gin Lys Pro Lys Ala Pro Val Gly His Phe Tyr Glu Pro Gin 
1185 1190 1195 1200 

Ala Pro Ser Ala Glu Val Glu Met Thr Ser Tyr Val Leu Leu Ala Tyr 

- 1205 1210 1215 

Leu Thr Ala Gin Pro Ala Pro Thr Ser Glu Asp Leu Thr Ser Ala Thr 

1220 1225 1230 

Asn He Val Lys Trp He Thr Lys Gin Gin Asn Ala Gin Gly Gly Phe 

1235 1240 1245 

Ser Ser Thr Gin Asp Thr Val Val Ala Leu His Ala Leu Ser Lys Tyr 

1250 1255 1260 

Gly Ala Ala Thr Phe Thr Arg Thr Gly Lys Ala Ala Gin Val Thr He 
1265 1270 1275 1280 

Gin Ser Ser Gly Thr Phe Ser Ser Lys Phe Gin Val Asp Asn Asn Asn 

1285 1290 1295 

Arg Leu Leu Leu Gin Glii Val Ser Leu Pro Glu Leu Pro Gly Glu Tyr 

1300 1305 1310 

Ser Met Lys Val Thr Gly Glu Gly Cys Val Tyr Leu Gin Thr Ser Leu 

1315 1320 1325 

Lys Tyr Asn lie Leu Pro Glu Lys Glu Glu Phe Pro Phe Ala Leu Gly 

1330 1335 1340 

Val Gin Thr Leu Pro Gin Thr Cys Asp Glu Pro Lys Ala His Thr Ser 
1345 1350 1355 . 1360 

Phe Gin He Ser Leu Ser Val Ser Tyr Thr Gly Ser Arg Ser Ala Ser 

1365 1370 1375 

Asn Met Ala He Val Asp Val Lys Met Val Ser Gly Phe He Pro Leu 

1380 1385 1390 

Lys Pro Thr Val Lys Met Leu Glu Arg Ser Asn His Val Ser Arg Thr 

1395 1400 1405 

Glu Val Ser Ser Asn His Val Leu lie Tyr Leu Asp Lys Val Ser Asn 

1410 1415 ~ 1420 

Gin Thr Leu Ser Leu Phe Phe Thr Val Leu Gin Asp Val Pro Val Arg 
1425 1430 1435 1440 

Asp Leu Lys Pro Ala He Val Lys Val Tyr Asp Tyr Tyr Glu Thr Asp 

1445 1450 1455 

Glu Phe Ala He Ala Glu Tyr Asn Ala Pro Cys Ser Lys Asp Leu Gly 
1460 1465 1470 

Asn Ala 
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<210> 6 
<211> 14896 
<212> DNA 

<213> Homo sapiens 
<400> 6 

cagcggtgcg agctccaggc ccatgcaetg aggaggcgga aacaagggga gcccccagag 60 

ctccatcaag ccccctccaa aggctcccct acccggtcca cgccccccac cccccctccc 120 

cgcctcctcc caattgtgca tttttgcagc cggaggcggc tccgagatgg ggctgtgagc 180 

ttcgcccggg gagggggaaa gagcagcgag gagtgaagcg ggggggtggg gtgaagggtt 240 

tggatttcgg ggcagggggc gcacccccgt cagcaggccc tccccaaggg gctcggaact 300 

ctacctcttc acccacgccc ctggtgcgct ttgccgaagg aaagaataag aacagagaag 360 

gaggaggggg aaaggaggaa aagggggacc ccccaactgg ggggggtgaa ggagagaagt 420 

agcaggacca gaggggaagg ggctgctgct tgcatcagcc cacaccatgc tgaccccgcc 480 

gttgctcctg ctgctgcccc tgctctcagc tctggtcgcg gcggctatcg acgcccctaa 540 

gacttgcagc cccaagcagt ttgcctgcag agatcaaata acctgtatct caaagggctg 600 

gcggtgcgac ggtgagaggg actgcccaga cggatctgac gaggcccctg agatttgtcc 660 

acagagtaag gcccagcgat gccagccaaa cgagcataac tgcctgggta ctgagctgtg 720 

tgttcccatg tcccgcctct gcaatggggt ccaggactgc atggacggct cagatgaggg 780 

gccccactgc cgagagctcc aaggcaactg ctctcgcctg ggctgccagc accattgtgt 840 

ccccacactc gatgggccca cctgctactg caacagcagc tttcagcttc aggcagatgg 900 

caagacctgc aaagattttg atgagtgctc agtgtacggc acctgcagcc agctatgcac 960 

caacacagac ggctccttca tatgtggctg tgttgaagga tacctcctgc agccggataa 1020 

ccgctcctgc aaggccaaga acgagccagt agaccggccc cctgtgctgt tgatagceaa 1080 

ctcccagaac atcttggcca cgtacctgag tggggcccag gtgtctacca tcacacctac 1140 

gagcacgcgg cagaccacag ccatggactt cagctatgcc aacgagaccg tatgctgggt 1200 

gcatgttggg gacagtgctg ctcagacgca gctcaagtgt gcccgcatgc ctggcctaaa 1260 

gggcttcgtg gatgagcaca ccatqaacat ctccctcagt ctgcaccacg tggaacagat 1320 

ggccatcgac tggctgacag gcaacttcta ctttgtggat gacatcgatg ataggatctt 1380 

tgtctgcaac agaaatgggg acacatgtgt cacattgcta gacctggaac tctacaaccc 1440 

caagggcatt gccctggacc ctgccatggg gaaggtgttt ttcactgact atgggcagat 1500 

cccaaaggtg gaacgctgtg acatggatgg gcagaaccgc accaagctcg tcgacagcaa 1560 

gattgtgttt cctcatggca tcacgctgga cctggtcagfc cgccttgtct actgggcaga 1620 

tgcctatctg gactatattg aagtggtgga ctatgagggc aagggccgcc agaccatcafc 1680 

ccagggcatc ctgattgagc acctgtacgg cctgactgtg tttgagaatt atctctatgc 1740 

caccaactcg gacaatgcca atgcccagca gaagacgagt gtgatccgtg tgaaccgctt 1800 

taacagcacc gagtaccagg ttgtcacccg ggtggacaag ggtggtgccc tccacatcta 1860 

ccaccagagg cgtcagcccc gagtgaggag ccafcgcctgt gaaaacgacc agtatgggaa 1920 

gccgggtggc tgctctgaca tctgcctgct ggceaacagc cacaaggcgc ggacctgccg 1980 

ctgccgttcc ggcttcagcc tgggcagtga cgggaagtca tgcaagaagc cggagcatga 2040 

gctgttcctc gtgtatggca agggccggcc aggcatcatc cggggcatgg atatgggggc 2100 

caaggtcccg gatgagcaca tgatccccat tgaaaacctc atgaaccccc gagccctgga 2160 

cttccacgct gagaccggct tcatctactt tgccgacacc accagctacc tcattggccg 2220 

ccagaagatt gatggcactg agcgggagac catcctgaag gacggcatcc acaatgtgga 2280 

SSgtgtggcc gtggactgga tgggagacaa tctgtactgg acggacgatg ggcccaaaaa 2340 

gacaatcagc gtggccaggc tggagaaagc tgctcagacc cgcaagactt taatcgaggg 2400 

caaaatgaca caccccaggg ctattgtggt ggatccactc aatgggtgga tgtactggac 2460 

agactgggag gaggacccca aggacagtcg gcgtgggcgg ctggagaggg cgtggatgga 2520 

tggctcacac cgagacatct ttgtcacctc caagacagtg ctttggccca atgggctaag 2580 

cctggacatc ccggctgggc gcctctactg ggtggatgcc ttctacgacc gcatcgagac 2640 

gatactgctc aatggcacag accggaagat tgtgtatgaa ggtcctgagc tgaaccacgc 2700 

ctttggcctg tgtcaccatg gcaactacct cttctggact gagtatcgga gtggcagtgt 2760 

ctaccgcttg gaacggggtg taggaggcgc accccccact gtgacccttc tgcgcagtga 2820 

gcggcccccc atctttgaga tccgaatgta tgatgcccag cagcagcaag ttggcaccaa 2880 

caaatgccgg gtgaacaatg gcggctgcag cagcctgtgc ttggccaccc ctgggagccg 2 940 

ccagtgcgcc tgtgctgagg accaggtgtt ggacgcagac ggcgtcactt gcttggcgaa 3000 

cccatcctac gtgcctccac cccagtgcca gccaggcgag tttgcctgtg ccaacagccg 3060 

ctgcatccag gagcgctgga agtgtgacgg agacaacgat tgcctggaca acagtgatga 3120 
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ggccccagcc ctctgccatc agcacacctg cccctcggac cgattcaagt gcgagaacaa 3180 

ccggtgcatc cccaaccgct ggctctgcga cggggacaat gactgtggga acagtgaaga 3240 

tgagtccaat gccacttgtt cagcccgcac ctgccccccc aaccagttct cctgtgccag 3300 

tggccgctgc atccccatct cctggacgtg tgatctggat gacgactgtg gggaccgctc 3360 

tgatgagtct gcttcgtgtg cctatcccac ctgcttcccc ctgactcagt ttacctgcaa 3420 

caatggcaga tgtatcaaca tcaactggag atgcgacaat gacaatgact gtggggacaa 3480 

cagtgacgaa gccggctgca gccactcctg ttctagcacc cagttcaagt gcaacagcgg 3540 

gcgttgcatc cccgagcact ggacctgcga tggggacaat gactgcggag actacagtga 3600 

tgagacacac gccaactgca ccaaccaggc cacgaggccc cctggtggct gccacactga 3660 

tgagttccag tgccggctgg atggactatg catccccctg cggtggcgct gcgatgggga 3720 

cactgactgc atggactcca gcgatgagaa gagctgtgag ggagtgaccc acgtctgcga 3780 

tcccagtgtc aagtttggct gcaaggactc agctcggtgc atcagcaaag cgtgggtgtg 3840 

tgatggcgac aatgactgtg aggataactc ggacgaggag aactgcgagt ccctggcctg 3900 

caggccaccc tcgcaccctt gtgccaacaa cacctcagtc tgcctgcccc ctgacaagct 3960 

gtgtgatggc aacgacgact gtggcgacgg ctcagatgag ggcgagctct gcgaccagtg 4020 

ctctctgaat aacggtggct gcagccacaa ctgctcagtg gcacctggcg aaggcattgt 4080 

gtgttcctgc cctctgggca tggagctggg gcccgacaac cacacctgcc agatccagag 4140 

ctactgtgcc aagcatctca aatgcagcca aaagtgcgac cagaacaagt tcagcgtgaa 4200 

gtgctcctgc tacgagggct gggtcctgga acctgacggc gagagctgcc gcagcctgga 4260 

ccccttcaag ccgttcatca ttttctccaa ccgccatgaa atccggcgca tcgatcttca 4320 

caaaggagac tacagcgtcc tggtgcccgg cctgcgcaac accatcgccc tggacttcca 43 80 

cctcagccag agcgccctct actggaccga cgtggtggag gacaagatct accgcgggaa 4440 

gctgctggac aacggagccc tgactagttt cgaggtggtg attcagtatg gcctggccac 4500 

acccgagggc ctggctgtag actggattgc aggcaacatc tactgggtgg agagtaacct 4560 

ggatcagatc gaggtggcca agctggatgg gaccctccgg accaccc tgc tggccggtga 4620 

cattgagcac ccaagggcaa tcgcactgga tccccgggat gggatcctgt tttggacaga 4680 

ctgggatgcc agcctgcccc gcattgaggc agcctccatg agtggggctg ggcgccgcac 4740 

cgtgcaccgg gagaccggct ctgggggctg gcccaacggg ctcaccgtgg actacctgga 4800 

gaagcgcatc ctttggattg acgccaggtc agatgccatt tactcagccc gttacgacgg 4860 

ctctggccac atggaggtgc ttcggggaca cgagttcctg tcgcacccgt ttgcagtgac 4920 

gctgtacggg ggggaggtct actggactga ctggcgaaca aacacactgg ctaaggccaa 4980 

caagtggacc ggccacaatg tcaccgtggt acagaggacc aacacccagc cctttgacct 5040 

gcaggtgtac cacccctccc gccagcccat ggctcccaat ccctgtgagg ccaatggggg 5100 

ccagggcccc tgctcccacc tgtgtctcat caactacaac cggaccgtgt cctgcgcctg 5160 

cccccacctc atgaagctcc acaaggacaa caccacctgc tatgagttta agaagttcct 5220 

gctgtacgca cgtcagatgg agatccgagg tgtggacctg gatgctccct actacaacta 5280 

catcatctcc ttcacggtgc ccgacatcga caacgtcaca gtgctagact acgatgcccg 5340 

cgagcagcgt gtgtactggt ctgacgtgcg gacacaggcc atcaagcggg ccttcatcaa 5400 

cggcacaggc gtggagacag tcgtctctgc agacttgcca aatgcccacg ggctggctgt 5460 

ggactgggtc tcccgaaacc tgttctggac aagctatgac accaataaga agcagatcaa 5520 

tgtggcccgg ctggatggct ccttcaagaa cgcagtggtg cagggcctgg agcagcccca 5580 

tggccttgtc gtccaccctc tgcgtgggaa gctctactgg accgatggtg acaacatcag 5640 

catggccaac atggatggca gcaatcgcac cctgctcttc agtggccaga agggccccgt 5700 

gggcctggct attgacttcc ctgaaagcaa actctactgg atcagctccg ggaaccatac 5760 

catcaaccgc tgcaacctgg atgggagtgg gctggaggtc atcgatgcca tgcggagcca 5820 

gctgggcaag gccaccgccc tggccatcat gggggacaag ctgtggtggg ctgatcaggt 5880 

gtcggaaaag atgggcacat gcagcaaggc tgacggctcg ggctccgtgg tccttcggaa 5940 

cagcaccacc ctggtgatgc acatgaaggt ctatgacgag agcatccagc tggaccataa 6000 

gggcaccaac ccctgcagtg tcaacaacgg tgactgctcc cagctctgcc tgcccacgtc 6060 

agagacgacc cgctcctgca tgtgcacagc cggctatagc ctccggagtg gccagcaggc 6120 

ctgcgagggc gtaggttcct ttctcctgta ctctgtgcat gagggaatca ggggaattcc 6180 

cctggatccc aatgacaagt cagatgccct ggtcccagtg tccgggacct cgctggctgt 6240 

cggcatcgac ttccacgctg aaaatgacac catctactgg gtggacatgg gcctgagcac 6300 

gatcagccgg gccaagcggg accagacgtg gcgtgaagac gtggtgacca atggcattgg 6360 

ccgtgtggag ggcattgcag tggactggat cgcaggcaac atctactgga cagaccaggg 6420 

ctttgatgtc atcgaggtcg cccggctcaa tggctccttc cgctacgtgg tgatctccca 6480 

gggtctagac aagccccggg ccatcaccgt ccacccggag aaagggtact tgttctggac 6540 

tgagtggggt cagtatccgc gtattgagcg gtctcggcta gatggcacgg agcgtgtggt 6600 



23 



WO 01/92474 



PCT/US01/18041 



gctggtcaac gtcagcatca gctggcccaa cggcatctca gtggactacc aggatgggaa 6660 

gctgtactgg tgcgatgcac ggacagacaa gattgaacgg atcgacctgg agacaggtga 6720 

gaaccgcgag gtggttctgt ccagcaacaa catggacatg ttttcagtgt ctgtgtttga 6780 

ggatttcatc tactggagtg acaggactca tgccaacggc tctatcaagc gcgggagcaa 6840 

agacaatgcc acagactccg tgcccctgcg aaccggcatc ggcgtccagc ttaaagacat 6900 

caaagtcttc aaccgggacc ggcagaaagg caccaacgtg tgcgcggtgg ccaatggcgg 6960 

gtgccagcag ctgtgcctgt accggggccg tgggcagcgg gcctgcgcct gtgcccacgg 7020 

gatgctggct gaagacggag catcgtgccg cgagtatgcc ggctacctgc tctactcaga 7080 

gcgcaccatt ctcaagagta tccacctgtc ggatgagcgc aacctcaatg cgcccgtgca 7140 

gcccttcgag gaccctgagc acatgaagaa cgtcatcgcc ctggcctttg actaccgggc 7200 

aggcacctct ccgggcaccc ccaatcgcat cttcttcagc gacatccact ttgggaacat 7260 

ccaacagatc aacgacgatg gctccaggag gatcaccatt gtggaaaacg tgggctccgt 7320 

ggaaggcctg gcctatcacc gtggctggga cactctctat tggacaagct acacgacatc 73 80 

caccatcacg cgccacacag tggaccagac ccgcccaggg gccttcgagc gtgagaccgt 7440 

catcactatg tctggagatg accacccacg ggccttcgtt ttggacgagt gccagaacct 7500 

catgttctgg accaactgga atgagcagca tcccagcatc atgcgggcgg cgctctcggg 7560 

agccaatgtc ctgaccctta tcgagaagga catccgtacc cccaatggcc tggccatcga 7620 

ccaccgtgcc gagaagctct acttctctga cgccaccctg gacaagatcg agcggtgcga 7680 

gtatgacggc tcccaccgct atgtgatcct aaagtcagag cctgtccacc ccttcgggct 7740 

ggccgtgtat ggggagcaca ttttctggac tgactgggtg cggcgggcag tgcagcgggc 7800 

caacaagcac gtgggcagca acatgaagct gctgcgcgtg gacatccccc agcagcccat 7860 

gggcatcatc gccgtggcca acgacaccaa cagctgtgaa ctctctccat gccgaatcaa 7920 

caacggtggc tgccaggacc tgtgtctgct cactcaccag ggccatgtca actgctcatg 7980 

ccgagggggc cgaatcctcc aggatgacct cacctgccga gcggtgaatt cctcttgccg 8040 

agcacaagat gagtttgagt gtgccaatgg cgagtgcatc aacttcagcc tgacctgcga 8100 

cggcgtcccc cactgcaagg acaagtccga tgagaagcca tcctactgca actcccgccg 8160 

ctgcaagaag actttccggc agtgcagcaa tgggcgctgt gtgtccaaca tgctgtggtg 8220 

caacggggcc gacgactgtg gggatggctc tgacgagatc ccttgcaaca agacagcctg 8280 

tggtgtgggc gagttccgct gccgggacgg gacctgcatc gggaactcca gccgctgcaa 8340 

ccagtttgtg gattgtgagg acgcctcaga tgagatgaac tgcagtgcca ccgactgcag 8400 

cagctacttc cgcctgggcg tgaagggcgt gctcttccag ccctgcgagc ggacctcact 8460 

ctgctacgca cccagctggg tgtgtgatgg cgccaatgac tgtggggact acagtgatga 8520 

gcgcgactgc ccaggtgtga aacgccccag atgccctctg aattacttcg cctgccctag 8580 

tgggcgctgc atccccatga gctggacgtg tgacaaagag gatgactgtg aacatggcga 8640 

ggacgagacc cactgcaaca agttctgctc agaggcccag tttgagtgcc agaaccatcg 8700 

ctgcatctcc aagcagtggc tgtgtgacgg cagcgatgac tgtggggatg gctcagacga 8760 

ggctgctcac tgtgaaggca agacgtgcgg cccctcctcc ttctcctgcc ctggcaccca 8820 

cgtgtgcgtc cccgagcgct ggctctgtga cggtgacaaa gactgtgctg atggtgcaga 8880 

cgagagcatc gcagctggtt gcttgtacaa cagcacttgt gacgaccgtg agttcatgtg 8940 

ccagaaccgc cagtgcatcc ccaagcactt cgtgtgtgac cacgaccgtg actgtgcaga 9000 

tggctctgat gagtcccccg agtgtgagta cccgacctgc ggccccagtg agttccgctg 9060 

tgccaatggg cgctgtctga gctcccgcca gtgggagtgt gatggcgaga atgactgcca 9120 

cgaccagagt gacgaggctc ccaagaaccc acactgcacc agcccagagc acaagtgcaa 9180 

tgcctcgtca cagttcctgt gcagcagtgg gcgctgtgtg gctgaggcac tgctctgcaa 9240 

cggccaggat gactgtggcg acagctcgga cgagcgtggc tgccacatca atgagtgtct 9300 

cagccgcaag ctcagtggct gcagccagga ctgtgaggac ctcaagatcg gcttcaagtg 9360 

ccgctgtcgc cctggcttcc ggctgaagga tgacggccgg acgtgtgctg atgtggacga 942 0 

gtgcagcacc accttcccct gcagccagcg ctgcatcaac acccatggca gctataagtg 9480 

tctgtgtgtg gagggctatg caccccgcgg cggcgacccc cacagctgca aggctgtgac 9540 

tgacgaggaa ccgtttctga tcttcgccaa ccggtactac ctgcgcaagc tcaacctgga 9600 

cgggtccaac tacacgttac ttaagcaggg cctgaacaac. gccgttgcct tggattttga 9660 

ctaccgagag cagatgatct actggacaga tgtgaccacc cagggcagca tgatccgaag 9720 

gatgcacctt aacgggagca atgtgcaggt cctacaccgt acaggcctca gcaaccccga 9780 

tgggctggct gtggactggg tgggtggcaa cctgtactgg tgcgacaaag gccgggacac 9840 

catcgaggtg tccaagctca atggggccta tcggacggtg ctggtcagct ctggcctccg 9900 

tgagcccagg gctctggtgg tggatgtgca gaatgggtac ctgtactgga cagactgggg 9960 

tgaccattca ctgatcggcc gcatcggcat ggatgggtcc agccgcagcg tcatcgtgga 10020 

caccaagatc acatggccca atggcctgac gctggactat gtcactgagc gcatctactg 10080 
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ggccgacgcc cgcgaggact acattgaatt tgccagcctg gatggctcca atcgccacgt 10140 

tgtgctgagc caggacatcc cgcacatctt tgcactgacc ctgtttgagg actacgtcta 10200 

ctggaccgac tgggaaacaa agtccattaa ccgagcccac aagaccacgg gcaccaacaa 10260 

aacgctcctc atcagcacgc tgcaccggcc catggacctg catgtcttcc atgccctgcg 10320 

ccagccagac gtgcccaatc acccctgcaa ggtcaacaat ggtggctgca gcaacctgtg 10380 

cctgctgtcc cccgggggag ggcacaaatg tgcctgcccc accaacttct acctgggcag 1044 0 

cgatgggcgc acctgtgtgt ccaactgcac ggctagccag tttgtatgca agaacgacaa 10500 

gtgcatcccc ttctggtgga agtgtgacac cgaggacgac tgcggggacc actcagacga 10560 

gcccccggac tgccctgagt tcaagtgccg gcccggacag ttccagtgct ccacaggtat 10620 

ctgcacaaac cctgccttca tctgcgatgg cgacaatgac tgccaggaca acagtgacga 10680 

ggccaactgt gacatccacg tctgcttgcc cagtcagttc aaatgcacca acaccaaccg 10740 

ctgtattccc ggcatcttcc gctgcaatgg gcaggacaac tgcggagatg gggaggatga 10800 

gagggactgc cccgaggtga cctgcgcccc caaccagttc cagtgctcca ttaccaaacg 10860 

gtgcatcccc cgggtctggg tctgcgaccg ggacaatgac tgtgtggatg gcagtgatga 10920 

gcccgccaac tgcacccaga tgacctgtgg tgtggacgag ttccgctgca aggattcggg 10980 

ccgctgcatc ccagcgcgtt ggaagtgtga cggagaggat gactgtgggg atggctcgga 11040 

tgagcccaag gaagagtgtg atgaacgcac ctgtgagcca taccagttcc gctgcaagaa 11100 

caaccgctgc gtgcccggcc gctggcagtg cgactacgac aacgattgcg gtgacaactc 11160 

cgatgaagag agctgcaccc ctcggccctg ctccgagagt gagttctcct gtgccaacgg 11220 

ccgctgcatc gcggggcgct ggaaatgcga tggagaccac gactgcgcgg acggctcgga 11280 

cgagaaagac tgcacccccc gctgtgacat ggaccagttc cagtgcaaga gcggccactg 11340 

catccccctg cgctggcgct gtgacgcaga cgccgactgc atggacggca gcgacgagga 11400 

ggcctgcggc actggcgtgc ggacctgccc cctggacgag ttccagtgca acaacacctt 11460 

gtgcaagccg ctggcctgga agtgcgatgg cgaggatgac tgtggggaca actcagatga 11520 

gaaccccgag gagtgtgccc ggttcgtgtg ccctcccaac cggcccttcc gttgcaagaa 11580 

tgaccgcgtc tgtctgtgga tcgggcgcca atgcgatggc acggacaact gtggggatgg 11640 

gactgatgaa gaggactgtg agccccccac agcccacacc acccactgca aagacaagaa 11700 

ggagtttctg tgccggaacc agcgctgcct ctcctcctcc ctgcgctgca acatgttcga 11760 

tgactgcggg gacggctctg acgaggagga ctgcagcatc gaccccaagc tgaccagctg 11820 

cgccaccaat gccagcatct gtggggacga ggcacgctgc gtgcgcaccg agaaagcggc 11880 

ctactgtgcc tgccgctcgg gcttccacac cgtgcccggc cagcccggat gccaagacat 11940 

caacgagtgc ctgcgcttcg gcacctgctc ccagctctgc aacaacacca agggcggcca 12000 

cctctgcagc tgcgctcgga acttcatgaa gacgcacaac acctgcaagg ccgaaggctc 12060 

tgagtaccag gtcctgtaca tcgctgatga caatgagatc cgcagcctgt tccccggcca 12120 

cccccattcg gcttacgagc aggcattcca gggtgacgag agtgtccgca ttgatgctat 12180 

ggatgtccat gtcaaggctg gccgtgtcta ttggaccaac tggcacacgg gcaccatctc 12240 

ctaccgcagc ctgccacctg ctgcgcctcc taccacttcc aaccgccacc ggcgacagat 12300 

tgaccggggt gtcacccacc tcaacatttc agggctgaag atgcccagag gcatcgccat 12360 

cgactgggtg gccggaaacg tgtactggac cgactcgggc cgagatgtga ttgaggtggc 12420 

gcagatgaag ggcgagaacc gcaagacgct catctcgggc atgattgacg agccccacgc 12480 

cattgtggtg gacccactga gggggaccat gtactggtca gactggggca accaccccaa 12540 

gattgagacg gcagcgatgg atgggacgct tcgggagaca ctggtgcagg acaacattca 12600 

gtggcccaca ggcctggccg tggattatca caatgagcgg ctgtactggg cagacgccaa 12660 

gctttcagtc atcggcagca tccggctcaa tggcacggac cccattgtgg ctgctgacag 12720 

caaacgaggc ctaagtcacc ccttcagcat cgacgtcttt gaggattaca tctatggtgt 12780 

cacctacatc aataatcgtg tcttcaagat ccataagttt ggccacagcc ccttggtcaa 12840 

cctgacaggg ggcctgagcc acgcctctga cgtggtcctt taccatcagc acaagcagcc 12900 

cgaagtgacc aacccatgtg accgcaagaa atgcgagtgg ctctgcctgc tgagccccag 12960 

tgggcctgtc tgcacctgtc ccaatgggaa gcggctggac aacggcacat gcgtgcctgt 13 020 

gccctctcca acgccccccc cagatgctcc ccggcctgga acctgtaacc tgcagtgctt 1308 0 

caacggtggc agctgtttcc tcaatgcacg gaggcagccc aagtgccgct gccaaccccg 1314 0 

ctacacgggt gacaagtgtg aactggacca gtgctgggag cactgtcgca atgggggcac 13200 

ctgtgctgcc tccccctctg gcatgcccac gtgccggtgc cccacgggct tcacgggccc 13260 

caaatgcacc cagcaggtgt gtgcgggcta ctgtgccaac aacagcacct gcactgtcaa 13320 

ccagggcaac cagccccagt gccgatgcct acccggcttc ctgggcgacc gctgccagta 13380 

ccggcagtgc tctggctact gtgagaactt tggcacatgc cagatggctg ctgatggctc 13440 

ccgacaatgc cgctgcactg cctactttga gggatcgagg tgtgaggtga acaagtgcag 13500 

ccgctgtctc gaaggggcct gtgtggtcaa caagcagagt ggggatgtca cctgcaactg 13560 
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cacggatggc cgggtggccc ccagctgtct gacctgcgtc ggccactgca gcaatggcgg . 13620 

ctcctgtacc atgaacagca aaatgatgcc tgagtgccag tgcccacccc acatgacagg 13680 

gccccggtgt gaggagcacg tcttcagcca gcagcagcca ggacatatag cctccatcct 13740 

aatccctctg ctgttgctgc tgctgctggt tctggtggcc ggagtggtat tctggtataa 13800 

gcggcgagtc caaggggeta agggcttcca gcaccaacgg atgaccaacg gggccatgaa 13860 

cgtggagatt ggaaacccca cctacaagat gtacgaaggc ggagagcctg atgatgtggg 13920 

aggcctactg gacgctgact ttgccctgga ccctgacaag cccaccaact tcaccaaccc 13980 

cgtgtatgcc acactctaca tggggggcca tggcagtcgc cactccctgg ccagcacgga 14040 

cgagaagcga gaactcctgg gccggggccc tgaggacgag ataggggacc ccttggcata 14100 

gggccctgcc ccgtcggact gcccccagaa agcctcctgc cccctgccgg tgaagtcctt 14160 

cagtgagccc ctccccagcc agcccttccc tggccccgcc ggatgtataa atgtaaaaat 14220 

gaaggaatta cattttatat gtgagcgagc aagccggcaa gcgagcacag tattatttct 14280 

ccatcccctc cctgcctgct ccttggcacc cccatgctgc cttcagggag acaggcaggg 14340 

agggcttggg gctgcacctc ctaccctccc accagaacgc accceactgg gagagctggt 14400 

ggtgcagcct tcccctccct gtataagaca ctttgccaag gctctcccct ctcgccccat 14460 

ccctgcttgc ccgctcccac agcttcctga gggctaattc tgggaaggga gagttctttg 14520 

ctgcccctgt ctggaagacg tggctctggg tgaggtaggc gggaaaggat ggagtgtttt 14580 

agttcttggg ggaggccacc ccaaacccca gccccaactc caggggcacc tatgagatgg 14640 

ccatgctcaa ccccqctccc agacaggccc tccctgtctc cagggceccc accgaggttc 14700 

ccagggctgg agacttcctc tggtaaacat tcctccagcc tcccctcccc tggggacgcc 14760 

aaggaggtgg gccacaccca ggaagggaaa gcgggcagcc ccgttttggg gacgtgaacg 14820 

ttttaataat ttttgctgaa ttctttacaa ctaaataaca cagatattct tataaataaa 14880 

attgtaaaaa aaaaaa 14896 
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